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WIGNER CHAOS AND THE FOURTH MOMENT 
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Universitdt des Saarlandes 

We prove that a normalized sequence of multiple Wigner inte- 
grals (in a fixed order of free Wigner chaos) converges in law to the 
standard semicircular distribution if and only if the corresponding 
sequence of fourth moments converges to 2, the fourth moment of 
the semicircular law. This extends to the free probabilistic, setting 
some recent results by Nualart and Peccati on characterizations of 
central limit theorems in a fixed order of Gaussian Wiener chaos. 
Our proof is combinatorial, analyzing the relevant noncrossing par- 
titions that control the moments of the integrals. We can also use 
these techniques to distinguish the first order of chaos from all others 
in terms of distributions; we then use tools from the free Malliavin 
calculus to give quantitative bounds on a distance between different 
orders of chaos. When applied to highly symmetric kernels, our re- 
sults yield a new transfer principle, connecting central limit theorems 
in free Wigner chaos to those in Gaussian Wiener chaos. We use this 
to prove a new free version of an important classical theorem, the 
Breuer-Major theorem. 

1. Introduction and background. Let (Wt)t>o be a standard one-dimen- 
sional Brownian motion, and fix an integer n > 1. For every deterministic 
(Lebesgue) square-integrable function / on M™ , we denote by I^{f) the nth 
(multiple) Wiener-Ito stochastic integral of / with respect to W (see, e.g., 
[17, 19, 27, 31] for definitions; here and in the sequel refers to the nonneg- 
ative half-line [0,oo)). Random variables such as '(/) play a fundamental 
role in modern stochastic analysis, the key fact being that every square- 
integrable functional of W can be uniquely written as an infinite orthogonal 
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sum of symmetric Wiener-Ito integrals of increasing orders. This feature, 
known as the Wiener-Ito chaos decomposition, yields an explicit represen- 
tation of the isomorphism between the space of square-integrable functionals 
of W and the symmetric Fock space associated with L 2 (]R + ). In particular, 
the Wiener chaos is the starting point of the powerful Malliavin calculus of 
variations and its many applications in theoretical and applied probability 
(see again [17, 27] for an introduction to these topics). We recall that the 
collection of all random variables of the type '(/), where n is a fixed in- 
teger, is customarily called the nth Wiener chaos associated with W . Note 
that the first Wiener chaos is just the Gaussian space spanned by W. 

The following result, proved in [29], yields a very surprising condition un- 
der which a sequence I^ifk) converges in distribution, as k — > oo , to a Gaus- 
sian random variable. [In this statement, we assume as given an underlying 
probability space (X, T, P), with the symbol E denoting expectation with 
respect to P.] 

Theorem 1.1 (Nualart, Peccati). Let n>2 be an integer, and let (fk)keN 
be a sequence of symmetric functions (cf. Definition 1.19 below) in L 2 (W^_), 
each with n!||/fc||z,2(K^) = 1- The following statements are equivalent: 

(1) The fourth moment of the stochastic integrals I^(fk) converge to 3. 

lim E(/r(/ fc ) 4 ) = 3. 

k— >oo 

(2) The random variables I^ifk) converge in distribution to the standard 
normal law N(0, 1). 

Note that the Wiener chaos of order n > 2 does not contain any Gaus- 
sian random variables, cf. [17], Chapter 6. Since the fourth moment of the 
normal N(0, 1) distribution is equal to 3, this Central Limit Theorem shows 
that, within a fixed order of chaos and as far as normal approximations are 
concerned, second and fourth moments alone control all higher moments of 
distributions. 

Remark 1.2. The Wiener isometry shows that the second moment 
of In(f) 1S equal to n!||/||| 2 , and so Theorem 1.1 could be stated intrinsi- 
cally in terms of random variables in a fixed order of Wiener chaos. Moreover, 
it could be stated with the a priori weaker assumption that (fk) 2 ) — > a 2 
for some a > 0, with the results then involving N(0, a 2 ) and fourth mo- 
ment 3d 4 , respectively. We choose to rescale to variance 1 throughout most 
of this paper. 

Theorem 1.1 represents a drastic simplification of the so-called "method 
of moments and cumulants" for normal approximations on a Gaussian space, 
as described, for example, in [20, 34]; for a detailed in-depth treatement of 
these techniques in the arena of Wiener chaos, see the forthcoming book [31]. 
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We refer the reader to the survey [23] and the forthcoming monograph [24] 
for an introduction to several applications of Theorem 1.1 and its many ram- 
ifications, including power variations of stochastic processes, limit theorems 
in homogeneous spaces, random matrices and polymer fluctuations. See in 
particular [22, 26, 28] for approaches to Theorem 1.1 based respectively on 
Malliavin calculus and Stein's method, as well as applications to universality 
results for nonlinear statistics of independent random variables. 

In the recent two decades, a new probability theory known as free probabil- 
ity has gained momentum due to its extremely powerful contributions both 
to its birth subject of operator algebras and to random matrix theory; see, for 
example, [1, 16, 21, 41]. Free probability theory offers a new kind of indepen- 
dence between random variables, free independence, that is, modeled on the 
free product of groups rather than tensor products; it turns out to succinctly 
describe the relationship between eigenvalues of large random matrices with 
independent entries. In free probability, the central limit distribution is the 
Wigner semicircular law [cf. equation (1.4)], further demonstrating the link 
to random matrices. Free Brownian motion, discussed in Section 1.2 below, 
is a (noncommutative) stochastic process whose increments are freely inde- 
pendent and have semicircular distributions. Essentially, one should think of 
free Brownian motion as Hermitian random matrix- valued Brownian motion 
in the limit as matrix dimension tends to infinity; see, for example, [7] for 
a detailed analysis of the related large deviations. 

If (St)t>o is a free Brownian motion, the construction of the Wiener- 
Ito integral can be mimicked to construct the so-called Wigner stochastic 
integral (cf. Section 1.3) Zf (/) of a deterministic function / £ L 2 (R"). The 
noncommutativity of St gives /„ different properties; in particular, it is no 
longer sufficient to restrict to the class of symmetric /. Nevertheless, there is 
an analogous theory of Wigner chaos detailed in [8] , including many of the 
powerful tools of Malliavin calculus in free form. The main theorem of the 
present paper is the following precise analog of the central limit Theorem 1.1 
in the free context. 

Theorem 1.3. Let n>2 be an integer, and let (fk)keN be a sequence 
of mirror symmetric functions (cf. Definition 1.19) in L 2 (M™) ; each with 
ll/fc||L 2 (R™) = 1- The following statements are equivalent: 

(1) The fourth moments of the Wigner stochastic integrals I^ifk) con- 
verge to 2. 

limE(/f(/ fc ) 4 )=2. 

fc— >oo 

(2) The random variables In(fk) converge in law to the standard semi- 
circular distribution S(0, 1) [cf. equation (1-4)] as k^-oo. 

Remark 1.4. The expectation E in Theorem 1.3(1) must be properly 
interpreted in the free context; in Section 1.1 we will discuss the right frame- 
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work (of a trace E = <p on the von Neumann algebra generated by the free 
Brownian motion). We will also make it clear what is meant by the law of 
a noncommutative random variable like I^(fk). 

Remark 1.5. Since the fourth moment of the standard semicircular dis- 
tribution is 2, (2) nominally implies (1) in Theorem 1.3 since convergence in 
distribution implies convergence of moments (modulo growth constraints); 
the main thrust of this paper is the remarkable reverse implication. The mir- 
ror symmetry condition on / is there merely to guarantee that the stochastic 
integral I^if) 1S indeed a self-adjoint operator; otherwise, it has no law to 
speak of (cf. Section 1.1). 

Our proof of Theorem 1.3 is through the method of moments which, in the 
context of the Wigner chaos, is elegantly formulated in terms of noncrossing 
pairings and partitions. While, on some level, the combinatorics of partitions 
can be seen to be involved in any central limit theorem, our present proof 
is markedly different from the form of the proofs given in [26, 28, 29]. All 
relevant technology is discussed in Sections 1.1-1.4 below; further details 
on the method of moments in free probability theory can be found in the 
book [21]. 

As a key step toward proving Theorem 1.3, but of independent interest 
and also completely analogous to the classical case, we prove the following 
characterization of the fourth moment condition in terms of standard in- 
tegral contraction operators on the kernels of the stochastic integrals (as 
discussed at length in Section 1.3 below). 

Theorem 1.6. Let n be a natural number, and let (fk)keN be a sequence 
of functions in L 2 (M"), each with HAHl 2 ^) = 1- The following statements 
are equivalent: 

(1) The fourth absolute moments of the stochastic integrals In(fk) con- 
verge to 2. 

limE(|/ n s (/ fc )| 4 )=2. 

fc— >-oo 

(2) All nontrivial contractions (cf. Definition 1.21) of fk converge to 0: 
for each p = 1,2, . . . , n — 1, 

lim /*A/* = znL 2 (4"~ 2p ). 

k— >oo 

While different orders of Wiener chaos have disjoint classes of laws, it is (at 
the present time) unknown if the same holds for the Wigner chaos. As a first 
result in this direction, the following important corollary to Theorem 1.6 
allows us to distinguish the laws of Wigner integrals in the first order of 
chaos from all higher orders. 
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COROLLARY 1.7. Let n>2 be an integer, and consider a nonzero mir- 
ror symmetric function f £ L 2 (M™). Then the Wigner integral I^if) satis- 
fies E[/^(/) 4 ] > 2E[/^(/) 2 ] 2 . In particular, the distribution of the Wigner 
integral Inif) cannot be semicircular. 

Combining these results with those in [22, 26, 28, 29], we can state the 
following Wiener-Wigner transfer principle for translating results between 
the classical and free chaoses. 

Theorem 1.8. Let n>2 be an integer, and let (fk)keN be a sequence 
of fully symmetric (cf. Definition 1.19) functions in L 2 {W]_). Let a > be 
a finite constant. Then, as k— >oo: 

(1) E[/^(/ fc ) 2 ] -> nla 2 if and only ifE[I% (f k ) 2 ] -> a 2 . 

(2) If the asymptotic relations in (1) are verified, then I^(fk) converges 
in law to a normal random variable N(0,n\a 2 ) if and only if converges 
in law to a semicircular random variable 5(0, a 2 ). 

Theorem 1.8 will be shown by combining Theorems 1.3 and 1.6 with the 
findings of [29]; the transfer principle allows us to easily prove yet unknown 
free versions of important classical results, such as the Breuer-Major theo- 
rem (Corollary 2.3 below). 

Remark 1.9. It is important to note that the transfer principle The- 
orem 1.8 requires the strong assumption that the kernels fk are fully sym- 
metric in both the classical and free cases. While this is no loss of generality 
in the Wiener chaos, it applies to only a small subspace of the Wigner chaos 
of orders 3 or higher. 

Corollary 1.7 shows that the semicircular law is not the law of any stochas- 
tic integral of order higher than 1. We are also able to prove some sharp 
quantitative estimates for the distance to the semicircular law. The key esti- 
mate, using Malliavin calculus, is as follows: it is a free probabilistic analog 
of [22], Theorem 3.1. We state it here in less generality than we prove it in 
Section 4.1. 

Theorem 1.10. Let S be a standard semicircular random variable [cf. 
equation (1-4)]- Let F have a finite Wigner chaos expansion; that is, F = 
En=i^n(/n) f or some mirror symmetric functions f n G L 2 (M") and some 
finite N. Let C2 and J?2 be as in Definition 3.16. Then 

d C2 (F,S) = sup \E[h(F)]-E[h(S)]\ 

h&C 2 

■y 2 {h)<i 

(1.1) 

< -E <g)E 
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The Malliavin calculus operators V and Nq and the product (J on tensor- 
product-valued biprocesses are defined below in Section 3, where we also 
describe all the relevant structure, including why the free Cameron-Gross- 
Malliavin derivative VtF of a random variable F takes values in the tensor 
product L 2 (R + ) ® L 2 (M_|_). The class C2 is somewhat smaller than the space 
of Lipschitz functions, and so the metric dc 2 on the left-hand side of equa- 
tion (4.1) is, a priori, weaker than the Wasserstein metric. This distance 
does metrize convergence in law, however. 

Remark 1.11. The key element in the proof of Theorem 1.10 is to mea- 
sure the distance between F and 5 by means of a procedure close to the 
so-called smart path method, as popular in Spin Glasses; cf. [36]. In this 
technique, one assumes that F and 5 are independent, and then assesses 
the distance between their laws by controlling the variations of the map- 
ping 1 1 y E[/i(t/1 — tF + yiS)] (where h is a suitable test function) over the 
interval [0, 1]. As shown below, our approach to the smart path method re- 
quires that we replace y/tS by a free Brownian motion St (cf. Section 1.2) 
freely independent from F, so that we can use the free stochastic calculus 
to proceed with our estimates. 

Using Theorem 1.10, we can prove the following sharp quantitative bound 
for the distance from any double Wigner integral to the semicircular law. 

Corollary 1.12. Let f £ L 2 (M.+) be mirror- symmetric and normalized 
ll/llL 2 (K n ) = 1j S be a standard semicircular random variable and let dc 2 
be defined as in equation (1.1). Then 



In principle, equation (1.1) could be used to give quantitative estimates 
like equation (1.2) for any order of Wigner chaos. However, the analogous 
techniques from the classical literature heavily rely on the full symmetry of 
the function /; in the more general mirror symmetric case required in the 
Wigner chaos, such estimates are, thus far, beyond our reach. 

The remainder of this paper is organized as follows. Sections 1.1 through 1.4 
give (concise) background and notation for the free probabilistic setting, free 
Brownian motion and its associated stochastic integral the Wigner integral 
and the relevant class of partitions (noncrossing pairings) that control mo- 
ments of these integrals. Section 2 is devoted to the proofs of Theorems 1.3 
and 1.6 along with Corollary 1.7 and Theorem 1.8. In Section 3, we collect 
and summarize all of the tools of free stochastic calculus and free Malli- 
avin calculus needed to prove the quantitative results of Section 4; this final 
section is devoted to the proofs of Theorem 1.10 (in Section 4.1) and Corol- 
lary 1.12 (in Section 4.2), along with an abstract list of equivalent forms 
of our central limit theorem in the second Wigner chaos. Finally, Appendix 
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contains the proof of Theorem 3.20, an important technical approximation 
tool needed for the proof of Theorem 1.10 but also of independent interest. 

1.1. Free probability. A noncommutative probability space is a complex 
linear algebra srf equipped with an involution (like the adjoint operation 
X \— > X* on matrices) and a unital linear functional (p: stf — > C. The stan- 
dard classical example is = L°°(£l, J 7 , P) where J 7 is a a- field of subset 
of Q, and P is a probability measure on T\ in this case the involution is 
complex conjugation and (p is expectation with respect to P. One can iden- 
tify J- from si through the idempotent elements which are the indicator 
functions lg of events E G J 7 , and so this terminology for a probability 
space contains the same information as the usual one. Another relevant ex- 
ample that is actually noncommutative is given by random matrices; here 

= L°°(yi, J 7 , P; M^(C)), d x <i-matrix-valued random variables, where the 
involution is matrix adjoint and the natural linear functional <p is given by 
tp(X) = ^ETr(X). Both of these examples only deal with bounded random 
variables, although this can be extended to random variables with finite 
moments without too much effort. 

The pair (L°°(il, J 7 , P),E) has a lot of analytic structure not present in 
many noncommutative probability spaces; we will need these analytic tools 
in much of the following work. We assume that s/ is a von Neumann algebra, 
an algebra of operators on a (separable) Hilbert space, closed under adjoint 
and weak convergence. Moreover, we assume that the linear functional ip is 
weakly continuous, positive [meaning <p(X) > whenever X is a nonnegative 
element of ; i.e., whenever X = YY* for some Y G faithful [meaning 
that if f{YY*) = 0, then Y = 0] and tracial, meaning that f{XY) = <p(YX) 
for ail X, Y G even though in general XY ^ YX . Such a (p is called a trace 
or tracial state. Both of the above examples (bounded random variables and 
bounded random matrices) satisfy these conditions. A von Neumann algebra 
equipped with a tracial state is typically called a (tracial) W* -probability 
space. Some of the theorems in this paper require the extra structure of a Im- 
probability space, while others hold in a general abstract noncommutative 
probability space. To be safe, we generally assume the VF*-setting in what 
follows. Though we do not explicitly specify traciality in the proceeding, we 
will always assume tp is a trace. 

In a W* -probability space, we refer to the self-adjoint elements of the 
algebra as random variables. Any random variable has a law or distribu- 
tion defined as follows: the law of X G is the unique Borel probability 
measure fix on K with the same moments as X; that is, such that 



The existence and uniqueness of [ix follow from the positivity of ip\ see [21], 
Propositions 3.13. Thus, in general, noncommutative probability, the method 
of moments and cumulants plays a central role. 
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In this general setting, the notion of independence of events is harder to 
pin down. Voiculescu introduced a general noncommutative notion of inde- 
pendence in [37] which has, of late, been very important both in operator 
algebras and in random matrix theory. Let si\ , . . . , si n be unital subalge- 
bras of si. Let X\, . . . ,X m be elements chosen from among the si^s such 
that, for 1 < j < m, Xj and Xj+\ do not come from the same si%, and 
such that <f(Xj) = for each j. The subalgebras sii, . . . ,si n are said to 
be free or freely independent if, in this circumstance, (p(X\X2 ■ ■ ■ X n ) = 0. 
Random variables are called freely independent if the unital algebras they 
generate are freely independent. By centering moments it is easy to check 
that, in the case that all the indices are distinct, this is the same as classi- 
cal independence expressed in terms of moments. For example, if X, Y are 
freely independent they satisfy (f[(X n - (p(X n ))(Y m - (f(Y m ))} = 0, which 
reduces to ip{X n Y m ) = (p(X n )ip(Y m ). But if there are repetitions of indices 
among the (generally noncommutative) random variables, freeness is much 
more complicated than classical independence; for example, if X, Y are free, 
then ip(XYXY) = (p(X 2 )(p(Y) 2 + (p(X) 2 tp(Y 2 ) - ip{X) 2 <p(Y) 2 . Nevertheless, 
if X, Y are freely independent, then their joint moments are determined 
by the moments of X and Y separately. Indeed, the law of the random 
variable X + Y is determined by (and can be calculated using the Stieltjes 
transforms of) the laws of X and Y separately. It was later discovered by 
Voiculescu [38] and others that pairs of random matrices with independent 
entries are (asymptotically) freely independent in terms of expected trace; 
this has led to powerful new tools for analyzing the density of eigenvalues 
of random matrices. 

The notion of conditioning is also available in free probability. 

Definition 1.13. Let (si, (p) be a ^-probability space, and let 98 C si 
be a unital VF*-subalgebra. There is a conditional expectation map ip[-\33] 
from £/ onto 3$. It is characterized by the property 

(1.3) <f[XY] = (p[X(p[Y\3g\] for all X G BS, Y £ jz?. 

Conditional expectation has the following properties: 

(1) <p\\9§\ is weakly continuous and completely positive; 

(2) ip[-\B$\ is a contraction (in operator norm) and preserves the identity; 

(3) If Y e si and X, Z G 38, then ip[XYZ\3g] = Xip[Y\3g]Z. 

If X E then we denote by v 3 !'!^] the conditional expectation onto the 
unital von Neumann subalgebra of si generated by X. 

Such conditional expectations were introduced in [35] [where properties 
(l)-(3) were proved]. As one should expect, if X and Y are free, then y?[y|X] = 
<p(Y), as in the classical case. Many analogs of classical probabilistic con- 
structions (such as martingales) are well-defined in free probability, using 
Definition 1.13. See, for example, [6] for a discussion of free Levy processes. 
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1.2. Free Brownian motion. The (centred) semicirclular distribution (or 
Wigner law) S(0, t) is the probability distribution 

(1.4) S(0,t)(dx) = — \ftt-x 2 dx, \x\<2y/i. 

2irt 

Since this distribution is symmetric about 0, its odd moments are all 0. Sim- 
ple calculation shows that the even moments are given by (scaled) Catalan 
numbers: for nonnegative integers m, 

[■2-Si 

/ x 2m S(0,t)(dx) =C m t m , 

where C m = ( 2 ^) . In particular, the second moment (and variance) is t 
while the fourth moment is 2t 2 . 

A free Brownian motion S = (St)t>o is a noncommutative stochastic pro- 
cess; it is a one-parameter family of self-adjoint operators St in a Im- 
probability space {sf,ip), with the following defining characteristics: 

(0) S = 0; 

(1) For < t\ < ti < oo, the law of St 2 — St 1 is the semicircular distribu- 
tion of variance ti — t±; 

(2) For all n and < t± < t2 < • • • < t n < oo, the increments S^ , St 2 — St x , 
St 3 — St 2 , ■ ■ ■ , St„ — St n _ 1 are freely independent. 

The freeness of increments can also be expressed by saying that St 2 — St 1 
is free from St ± whenever ti > t\ > 0; here St is the von Neumann algebra 
generated by {S s : < s < i}. In particular, it follows easily that ip[St 2 |<Sti] = 
St x for t2 > t\ > 0, so free Brownian motion is a martingale. 

There are at least two good ways to construct a free Brownian motion S. 
The first involves the free (Boltzman) Fock space &o($)) constructed on 
a Hilbert space S): J^q(Sj) = ©.^Lo^™ wnere the direct-sum and tensor 
products are Hilbert space operations, and is defined to be a one- 
dimensional complex space with a distinguished unit basis vector called the 
vacuum f2 (not to be confused with the state space of a probability space). 
Given any vector h£ fj, the creation operator a^(h) on ^o(Sj) is defined by 
left tensor-product with h: a) (h)ijj = h ® ift . Its adjoint a(h) is the annihila- 
tion operator, whose action on an n-tensor is given by a(h)h\ ® • • • ® h n = 
(h,hi)h2 <S> • ■ • ® h n [and a(h)il = 0]. The creation and annihilation opera- 
tors are thus raising and lowering operators. Their sum X(h) = a)(h) -\-a(h) 
is a self-adjoint operator known as the field operator in the direction h. 
Let S(f)) denote the von Neumann algebra generated by {X(h);h £ f)}, 
a (small) subset of all bounded operators on the Fock space &o(S)). The 
vacuum expectation state <p(Y) = (YQ,Q)jF Q rfy is a tracial state on <S (.£)). 
Now, take the special case Sj = L 2 (R + ); then S t = X(t m ) is a free Brown- 
ian motion with respect to (S($j),(p). 
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Remark 1.14. This construction of Brownian motion can also be done 
in the classical case, replacing the free Fock space with the symmetric 
(Bosonic) Fock space; for this line of thought see [30]. Although it is abstract, 
it is directly related to concrete constructions in the Wigner, and Wiener, 
chaos. Note: when fj = L 2 (M + ), fj® n may be identified with L 2 (]R™ ), and it 
is these kernels we will work with throughout most of this paper. 

A second, more appealing (if less direct) construction of free Brownian 
motion uses random matrices. Let Wf be a d x d complex Hermitian ma- 
trix all of whose entries above the main diagonal are independent complex 
standard Brownian motions. Set Sf = d~ l / 2 W d . Then the "limit as d — > oo" 
of Sf is a free Brownian motion. This limit property holds in the sense of mo- 
ments, as follows: equip the algebra S d generated by {Sf;t E M + } with the 
tracial state ip d = ^ETr. Then if P = P(Xi,X 2 , ...,X k ) is any polynomial 
in A: noncommuting indeterminates, and t\, . . . ,tk € K+, then 

lim <p d [P(Si ,...,S? k )] = <p[P(S tl ,..., S tk )] , 

where S = (St)t>o is a free Brownian motion. So, at least in terms of mo- 
ments, we may think of free Brownian motion as "infinite-dimensional matrix- 
valued Brownian motion." 

Remark 1.15. The algebra S d of random matrices described above is 
not a von Neumann algebra in the standard sense, since its elements do not 
have finite matrix norms in the standard sup metric. The Gaussian tails of 
the entries guarantee, however, that mixed matrix moments of all orders are 
finite, which is all that is needed to make sense of the standard notion of 
convergence in noncommutative probability theory. 

1.3. The Wigner integral. In this section we largely follow [8]; related dis- 
cussions and extensions can be found in [2-4]. Taking a note from Wiener 
and Ito, we define a stochastic integral associated with free Brownian mo- 
tion in the usual manner. Let S be a free Brownian motion, and let / £ 
L 2 (R") be an off-diagonal rectangular indicator function, taking the form 
/ = 1 [si,ti]x-x[s„ ! t n ] I where the intervals [si.tj, [s n ,t n ] are pairwise 
disjoint. The Wigner integral /„(/) is defined to be the product operator 
In(f) — (<S<i — <Ssi) " • (St n — S Sn ). Extend linearly over the set of all 
off-diagonal step- functions, which is dense in L 2 (R"). The freeness of the 
increments of S yield the simple Wigner isometry 

(1-5) M(siriZ(f)] = {f,9)v<Ry- 

In other words, is an isometry from the space of off-diagonal step func- 
tions into the Hilbert space of operators generated by the free Brownian mo- 
tion 5, equipped with the inner product (X,Y)u> = ip\Y*X], This means I„ 
extends to an isometry from the closure, which is the full space L 2 (M"), 
thus fully defining the Wigner integral. If / is any function in L 2 (M"), we 
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may write 

In(f) = j f(h,...,t n )ds tl ---ds tn . 

This stands in contrast to the classical Gaussian Wiener integral, which we 
shall denote 1^ : 

C(f) = I f(h,...,t n )dw tl ---dw tn . 

Remark 1.16. This construction long post-dates Wigner's work. The 
terminology was invented in [8] as a humorous nod to the fact that Wigner's 
semicircular law plays the Central Limit role here, and the similarity between 
the names Wigner and Wiener. 

Remark 1.17. This is the same as Ito's construction of the multiple 
Wiener integral in classical Wiener-Ito chaos. Note, however, that the incre- 
ments Sft — S Sl , St n — S Sn do not commute. Hence, unlike for the Wigner 
integral, permuting the variables of / generally changes the value of 

The image of the n-fold Wigner integral 1^ on all of L 2 (W^_ ) is called the 
nth order of Wigner chaos or free chaos. It is easy to calculate that different 
orders of chaos are orthogonal from one another (in terms of the trace inner 
product); this also follows from contraction and product formulas below. 
The noncommutative L 2 -space generated by (St)t>o is the orthogonal sum 
of the orders of Wigner chaos; this is the free analog of the Wiener chaos 
decomposition. 

Remark 1.18. The first Wigner chaos, the image of if, is a centred 
semicircular family in the sense of [21], Definition 8.15, exactly as the first 
Wigner chaos is a centred Gaussian family. In particular, In the first order 
of Wigner chaos, the law of any random variable is semicircular S(0,t) for 
some variance t > 0. 

We are generally interested only in self-adjoint elements of a given order 
of chaos. Taking note of Remark 1.17, we have 



In(J)*=(J f(h,...,t n )dS tl ---dS t , 




where /*(£i, ■ • ■ ,t n ) = f(t n , . . . ,ti). This prompts a definition. 



Definition 1.19. Let n be a natural number, and let / be a function 
in L 2 (M™). 
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(1) The adjoint of / is the function f*(ti,..., t n ) = f(t n , . . . , t\). 

(2) / is called mirror symmetric if / = /*; that is, if f(t±,...,t n ) = 
f(t n , . . . ,t±) for almost all t%, . . . ,t n > with respect to the product Lebesgue 
measure. 

(3) / is called fully symmetric if it is real-valued and, for any permu- 
tation a in the symmetric group S n , f(ti, . . . ,t n ) = /(i CT (i), . . . ,i<r(n)) for 
almost all t\, . . . ,t n > with respect to the product Lebesge measure. 

Thus an element In(f) of the nth Wigner chaos is self adjoint iff / is 
mirror symmetric. Note, in the classical Gaussian Wiener chaos, it is typical 
to consider only kernels that are fully symmetric, since if / is constructed 
from / by permuting its arguments, then I^(f) = /„(/). This relation does 
not hold for 1°. 

Remark 1.20. The calculation in equation (1.6) may seem nonrigorous. 
A more pedantic writing would do the calculation first for an off-diagonal 
rectangular indicator function / = ^[ Sl .t 1 ]x---x[s n ,t n ]i m which case the adjoint 
is merely [(S tl -S S1 )--- (S tn - S Sn )]* = (S tn - S Sn ) • • • (S tl - S Sl ) since S t is 
self adjoint; extending (sesqui) linearly and completing yields the full result. 
This is how statements like (dS^ • • • dSt n )* = dSt n • • • dSt ± should be inter- 
preted throughout this paper. 

Contractions are an important construction in Wigner and Wiener chaos; 
we briefly review them now. 

Definition 1.21. Let n, m be natural numbers, and let / G L 2 (]R™ ) and 
g € L 2 (W£). Let p < min{n,m} be a natural number. The pth contraction 

f ^ g of / and g is the L 2 (R™ +m ~ 2p ) function defined by nested integration 
of the middle p variables in / (g) g 

f ^ 9(tl, ■ ■ ■ , t n +m-2p) = / f(tl,---,t n -p,Sl,...,S p ) 

Jr p + 

x d( s pi ■ • ■ 5 s l) tn—p+li ■ ■ ■ i tn+m—2p) ds\ • • • ds p . 

Notice that when p = 0, there is no integration, just the products of / 
and g with disjoint arguments; in other words, / ^ g = f ' ® g. 

Remark 1.22. It is easy to check that the operation ^ is not generally 
associative. 

Remark 1.23. In [22, 26, 28, 29] as well as standard references like [23, 
24, 27], contractions are usually defined as follows: 

/ ®p fi^l ) • • • ) tn+m— 2p) = I {ti, . . . ,t n —p, Si, . . . , Sp) 

JR P + 

x diin—p+l j • ■ • i tn+m— 2pi s l i • • • i s p) ds\ • • • dSp. 
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Notice that this operation is related to our nested contraction ^ as follows: 

f ®p 9* (^1 5 ■ • ■ ) ^n— pi ^n+m— 2p; ■ • ■ i tn—p+1 ) — / 3(^1 > • ■ • > tn+m— 2p) • 

In other words, up to reordering of variables, the two operations are the 
same. In particular, if /, g are fully symmetric, then / g and / ® p g have 
the same symmetrizations. This will be relevant to Theorem 1.8 below. 

The following lemma records two useful facts about contractions and ad- 
joints; the proof is easy calculation. 

Lemma 1.24. Let n,m be natural numbers, and let f G L 2 (W1) and 
g£L 2 (M.™). 

(1) If p< min{n, m} is a natural number, then (/ ^ g)* = g* ^> /* . 

(2) Ifn = m, then the constant f ^> g satisfies f-^g = g^f = (f, 5'*)l 2 (r™) • 

Contractions provide a useful tool for decomposing products of stochastic 
integrals, in precise analogy to the classical context. The following is [8], 
Proposition 5.3.3. 

Proposition 1.25 (Biane-Speicher) . Let n,m be natural numbers, and 
let f G L 2 (IR™ ) and geL 2 (R™). Then 

min{n,m} 

(1.7) lS{f)-li(9)= £ I* + m-2 P (f-9). 

Remark 1.26. In the Gaussian Wiener chaos, a similar though more 
complicated product formula holds. 

min{n,m} / \ / \ 
tf(f)-I%(9)= E Pl(p)(p)«m-*(/^^ 

It is common for formulas from classical probability to have free probabilistic 
analogs with simpler forms, usually with binomial coefficients removed. This 
can be understood in terms of the relevant (smaller) class of partitions that 
control moments in the theory, as we discuss in Section 1.4 below. 

1.4. Noncrossing partitions. Proposition 1.25 shows that contractions 
are involved in the algebraic structure of the space of stochastic integrals. 
Since contractions involve integrals pairing different classes of indices, gen- 
eral moments of stochastic integrals are best understood in terms of a more 
abstract description of these pairings. For convenience, we write [n] to rep- 
resent the set [n] = {1, 2, . . . , re} for any positive integer n. If re is even, then 
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Fig. 1. Two pairings of [6] ={1,2,3,4,5,6}. The first (totally-nested) pairing is non- 
crossing, while the second is not. 

a pairing or matching of [n] is a partition of [n] into n/2 disjoint subsets 
each of size 2. For example, {{1, 6}, {2, 5}, {3, 4}} and {{1, 2}, {3, 5}, {4, 6}} 
are two pairings of [6] = {1,2,3,4,5,6}. It is convenient to represent such 
pairings graphically, as in Figure 1. 

It will be convenient to allow for more general partitions in the sequel. 
A partition of [n] is (as the name suggests) a collection of mutually disjoint 
nonempty subsets B± , . . . , B r of [n] such that B\ U • ■ • U B r = [n]. The subsets 
are called the blocks of the partition. By convention we order the blocks by 
their least elements; that is, mm Bi < mm Bj iff i < j. The set of all partitions 
on [n] is denoted ^(n), and the subset of all pairings is 

Definition 1.27. Let ir e &(n) be a partition of [n]. We say ir has 
a crossing if there are two distinct blocks B\ , Bi in ir with elements x\,yi € B\ 
and X2,U2 £ B2 such that x\ < x<i < y\ < fj2- (This is demonstrated in Fig- 
ure 1.) 

If 7T € 8?{n) has no crossings, it is said to be a noncrossing partition. 
The set of noncrossing partitions of [n] is denoted NC{n). The subset of 
noncrossing pairings is denoted NC2(n). 

The reader is referred to [21] for an extremely in-depth discussion of 
the algebraic and enumerative properties of the lattices NC{n). For our 
purposes, we present only those structural features that will be needed in 
the analysis of Wigner integrals. 

Definition 1.28. Let ni, . . . , n r be positive integers with n = n\ -) V n r . 

The set [n] is then partitioned accordingly as [n] = B\ U • • • U B r where 
B\ = {1, . . . ,n±}, B2 = {n\ + 1, . . . , n\ + 712}, and so forth through B r = 
{rii -| h n r -i + 1, . . . , ni -I h n r }. Denote this partition as n\ ® • • • <g) n r . 

Say that a pairing it G &2{ n ) respects n\ ® ■ ■ ■ ® n T if no block of tt con- 
tains more than one element from any given block of n\ ® • ■ ■ ® n r . (This 
is demonstrated in Figure 2.) The set of such respectful pairings is denoted 
^2(1^1 <8> • • • <8> n r ). The set of noncrossing pairings that respect n\ <8> • • • <8> n r 
is denoted NC2{n\ ® ■ ■ ■ ® n r ). 

Partitions n\ % ■ ■ ■ % n r as described in Definition 1.28 are called interval 
partitions, since all of their blocks are intervals. Figure 2 gives some examples 
of respectful pairings. 



WIGNER CHAOS AND THE FOURTH MOMENT 



15 



n n 



i i i 



i i i 



n n 



n n 



Fig. 2. The partition 4®3<8>l<g>2®2 is drawn above the dots; below are three pairings 
that respect it. The two bottom pairings are in iVC2(4 <g>3(g>l<g>2<g>2). 



Remark 1.29. The same definition of respectful makes perfect sense for 
more general partitions, but we will not have occasion to use it for anything 
but pairings. However, see Remark 1.32. 

Remark 1.30. Consider the partition n\ n r — { B\ , . . . , B r } , as 

well as a pairing tt G &2 ( n ) , where n = n\ + • • • + n r . In the classical literature 
about Gaussian subordinated random fields (cf. [31], Chapter 4, and the 
references therein) the pair [n\ (g) ■ ■ • ® n r , tt) is represented graphically as 
follows: (i) draw the blocks B±, . . . , B r as superposed rows of dots (the ith 
row containing exactly rii dots, i = l,...,r), and (ii) join two dots with 
an edge if and only if the corresponding two elements constitute a block 
of tt. The graph thus obtained is customarily called a Gaussian diagram. 
Moreover, if tt respects ni (g> • • ■ ® n r according to Definition 1.28, then the 
Gaussian diagram is said to be nonflat, in the sense that all its edges join 
different horizontal lines, and therefore are not flat, that is, not horizontal. 
The noncrossing condition is difficult to discern from the Gaussian diagram 
representation, which is why we do not use it here; therefore the nonflat 
terminology is less meaningful for us, and we prefer the intuitive notation 
from Definition 1.28. 

One more property of pairings will be necessary in the proceeding analysis. 

Definition 1.31. Let m, . . . ,n r be positive integers, and let tt g ^2(^1® 
• • • ® n r ) . Let B\ , B2 be two blocks in m <g> • • • <g> n r . Say that ir links B\ and 
B2 if there is a block {i, j} G tt such that i G B\ and j G -62- 

Define a graph C n whose vertices are the blocks of n\ ® ■ • ■ ® n r ; C n has 
an edge between B\ and B2 iff tt links B\ and B2. Say that tt is connected 
with respect to n\ ® ■ • ■ (8) n r (or that tt connects the blocks of n\ ® ■ • ■ ® n r ) 
if the graph C n is connected. 

Denote by NC^ini (g> • • • (g> n r ) the set of noncrossing pairings that both 
respect and connect n\ ® ■ ■ ■ ® n r . 
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For example, the second partition in Figure 2 is in iVC|(4 ®3<8>l<X>2(g>2), 
while the third is not. The interested reader may like to check that iVC2(4<8> 
3 <8> 1 <8> 2 <8> 2) has 5 elements, and all are connected except the third example 
in Figure 2. 

Remark 1.32. For a positive integer n, the set NC{n) of noncrossing 
partitions on [n] is a lattice whose partial order is given by reverse refine- 
ment. The top element l n is the partition {{1, . . . ,n}} containing only one 
block; the bottom element n is {{1}, . . . , {n}} consisting of n singletons. 
The conditions of Definitions 1.28 and 1.31 can be described elegantly in 
terms of the lattice operations meet A (i.e., inf) and join V (i.e., sup). If 
n = n\ + • ■ • + n r , then tt G NCzin) respects n\ <g> ■ • ■ ® n r if and only if 
vr A (ni Cg) • • • <g) n r ) = n ; tt connects the blocks of n\ ® ■ • ■ <g) n r if and only if 
tt V (n\ (§) ■ ■ ■ <g) n r ) = l n . 

Remark 1.33. Given n\,...,n r and a respectful noncrossing pairing 
tt G NC2{n\ ® ■ ■ ■ ® n r ), there is a unique decomposition of the full index 
set [n], where n = n\ + ■ • ■ + n r , into subsets Di, . . . , D m of the blocks of 
n\ ® • • • ® n r , such that the restriction of tt to each connects the blocks 
of Di. These Di are the vertices of the graph CV grouped according to 
connected components of the graph. For example, in the third pairing in 
Figure 2, the decomposition has two components, D\ = 4® 3® 1 and D2 = 
2 (8) 2. To be clear, this notation is slightly misleading since the 2 ® 2 in this 
case represents indices {9, 10}, {11, 12}, not {1,2}, {3, 4}; we will be a little 
sloppy about this to make the following much more readable. 

There is a close connection between respectful noncrossing pairings and 
expectations of products of Wigner integrals. To see this, we first introduce 
an action of pairings on functions. 

Definition 1.34. Let n be an even integer, and let tt G ^(n). Let 
— > C be measurable. The pairing integral of / with respect to tt, de- 
noted J f, is defined (when it exists) to be the constant 



For example, given the second pairing tt = {{1, 2}, {3, 5}, {4, 6}} in Fig- 
ure 1, 



Remark 1.35. The operation J is not well defined on L 2 (M"); for ex- 
ample, if n = 2 and tt = {{1,2}}, then f n f is finite if and only if / is the 
kernel of a trace class Hilbert-Schmidt operator on L 2 (M + ). However, it is 





f(r, r, s, t, s, t) dr ds dt. 
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n m 




Fig. 3. A partial pairing t v of [n + m] corresponding to a p-contraction; here n — 6, 
m = 7, and p = 4. 

easy to see that J f is well-defined whenever / is a tensor product of func- 
tions, and 7r respects the interval partition induced by this tensor product 
(cf. Lemma 2.1). (This is one of the reasons why one should interpret mul- 
tiple stochastic integrals as integrals on product spaces without diagonals, 
since integrals on diagonals are in general not defined.) This is precisely the 
case we will deal with in all of the following. 

Note that a contraction f g can be interpreted in terms of a pairing 
integral, using a partial pairing, that is, one that pairs only a subset of 
the indices. If / 6 L 2 (M" ) and g £ L 2 (R™), and p < min{n, m} is a natural 
number, then 

fAg= f f®g, 

where t p is the partial pairing {{n, n + 1}, {n — 1, n + 2}, . . . , {n— p+ 1, n+p}} 
of [n + m] . 

The partial contraction pairings r p provide a useful decomposition of the 
set of all respectful noncrossing pairings, in the following sense. Let m,...,n r 
be positive integers. If p < min{ni,n2}, the partial pairing t p acts (on the 
left) on the partition n\ ® rii <g> n% <g> • • • <8> n T to produce the partition [n\ + 
ri2 — 2p) <8> ^3 <8> • • • <8> n r . That is, t p joins the first two blocks of n\ ® ■ ■ ■ (8> n r 
and deletes the paired indices to produce a new interval partition. This is 
demonstrated in Figure 4. 

Considered as such a function, we may then compose partial contraction 
pairings. For example, following Figure 4, we may act again with T\ on 5 ® 

rm m i n n ^ rrm i n n 

1 2 3 45 6 78 9101112 1 2 3 6 7 8 9101112 

I I 

Fig. 4. The partial pairing n acts on the left on 4cg>3Cg>l<g)2® 2, joining the first two 
blocks and deleting the middle indices, to produce the partition 5 <g 1 <8> 2 ® 2. 27ie indices 
are labeled to make the action clearer. 
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1 2 3 4 5 6 7 



10 11 12 



1 2 3 6 7 8 9 10 11 12 



n 



T) 



n n 



1 2 3 6 9 10 11 12 



n n 

1 2 11 12 
• • • • 



T-2 



rm m i n n 

1 2 3456 789 10 11 12 

* * i Li i i-i i i ' ' 



t 2 o r 2 o n o n 

Fig. 5. The composition T2 o T2 o t\ o ti produces a noncrossing pairing that respects 
4 ® 3 <g>l <g> 2® 2. 

1 (g) 2 ® 2 to yield 4 ® 2 ® 2; then with T2 to get 2 <g) 2; and finally T2 maps this 
partition to the empty partition. Stringing these together gives a respectful 
pairing of the original interval partition, which we denote T2 o ti ot\ ot\. 
Figure 5 displays this composition. 

To be clear: we start from the left and then do the partial pairing t p 
between the first and second block; after this application, the (rest of the) 
first and second blocks are treated as a single block. This is still the case 
if p = 0; here there are no paired indices, but the action of To records the 
fact that, for further discussion, the first two blocks are now connected. An 
example is given in Figure 6 below, where the action of to is graphically 
represented by a dashed line. 

With this convention, further r p may act only on the first two blocks, 
which results in a unique decomposition of any respectful pairing into partial 
contractions, as the next lemma makes clear. 

Lemma 1.36. Let rii,...,n r be positive integers, and let tt £ NC2(n\ ® 
•••<S>n r ). There is a unique sequence of partial contractions r Pl , . . . , r Pr _ 1 
such that tt = T Pr _ 1 o • • • o t Pi . 

Proof. Any noncrossing pairing must contain an interval {i,i + 1}; 
cf. [21], Remark 9.2(2). Hence, since tt respects n\ ® ■ ■ ■ ®n r = {Bi, . . . ,B r }, 
there must be two adjacent blocks linked by tt. Let j £ [k] be the small- 
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n n 



n 



T2 



n n 



T 3 O T O T 2 



Fig. 6. The pairing tt = {{1, 10}, {2, 5}, {3, 4}, {6, 9}, {7, 8}} respects the interval parti- 
tion 3 g) 2 ® 2 ® 3. fe decomposition is given by n = T3 o to o T2. 



est index for which Bj,Bj + ± are connected by 7r; hence all of the blocks 
B\,...,Bj pair among the blocks Bj+i, . . . ,B r . Note that any partition 
that satisfies this constraint and also respects the coarser interval parti- 
tion (m H h nj) <g) nj+i ® • • • ® n r is automatically in ./VC2 (ni (8) • • • (g) n r ) . 

In other words, we can begin by decomposing tt = tt 1 o (to) j— , where 7r' £ 
NC-2{(n\ + ■ ■ ■ + rij) n-j+i ® • ■ • ® n r ) links the first and second blocks of 
this interval partition. By construction, this j is unique. 

Let no = ni H 1- nj, so 7r' links {1, . . . , no} with {no + 1, . . . , no + ^j+i}- 

It follows that {no, no + 1} € vr': for if no pairs with some element no + i with 
i > 2, then no + 1, . . . , no + i — 1 cannot pair anywhere without introducing 
crossings. Following these lines, an easy induction shows that there is some 
p G [min{no, %"+i}] such that the pairs {no, no + 1}, {no — 1, no + 2}, ... , {no — 
p + l,no +p} are in it', while all indices 1, . . . , no — p and no +p + 1, . . . ,no + 
nj + i pair outside [no + Tij+i] ■ In other words, ir' = tt" o r p for some nonpross- 
ing pairing it" that respects (no — p) ® (n J+ i — p) ® 713 ® • • • (8) n r . What's 
more, since p was chosen maximally so that there are no further pairings in 
the blocks (no — p) <8> (%+i — p), these two may be treated as a single block, 
and tt" is only constrained to be in iVC^no + n^+i — 2p,n3, . . . ,n r ). Since 
p > 0, the lemma now follows by a finite induction; uniqueness results from 
the left-most choice of j and maximal choice of p at each stage. □ 

By carefully tracking the proof of Lemma 1.36, we can give a complete 
description of the class of respectful pairings in terms of their decomposi- 
tions. 

Lemma 1.37. Let ni,...,n r be positive integers. The class iVC^ni <g> 
• • -(S>n r ) is equal to the set of compositions Tp^o- • -ot Pi where (p\, . . . ,p r -i) 
satisfy the inequalities 

< pi < min{n2, ni}, 

(1.8) < p k < min{n fc+ i, ni H h n k - 2p 1 2p fc _i}, 

1 < k < r — l,2(pi H hPr-i) = ni H h n r . 
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Inequalities (1.8) in Lemma 1.37 successively guarantee that the par- 
tial contractions T Pk in the decomposition of it only contract elements from 
within two adjacent blocks; the final equality is to guarantee that all indices 
are paired in the end. Since every respectful pairing has a contraction decom- 
position, and each contraction decomposition satisfying inequalities (1.8) is 
respectful (a fact which follows from an easy induction), these inequalities 
define iVC^ni ® • • • <S>n r ). This completely combinatorial description would 
be the starting point for an enumeration of the class of respectful pairings; 
however, even in the case m = • • • = n r , the enumeration appears to be ex- 
tremely difficult. 

We conclude this section with a proposition that demonstrates the effi- 
cacy of pairing integrals and noncrossing pairings in the analysis of Wigner 
integrals. 

Proposition 1.38. Letn±, ...,n r be positive integers, and suppose /i,..., 
f r are functions with /j G L 2 (R™') for 1 < i < r. The expectation ip of the 

product of Wigner integrals I~ (/i) • • • 1^ (f r ) is given by 

(1.9) (/!)•••/£(/.•)]= E //l®-"®/r. 

tt£NC 2 (ni®-®nr) 7r 

Remark 1.39. This result has been used in the literature (e.g., to 
prove [8], Theorem 5.3.4), but it appears to have a folklore status in that 
a proof has not been written down. The following proof is an easy application 
of Proposition 1.25, together with Lemma 1.37. 

Proof. By iterating equation (1.7), we arrive at the following unwieldy 
expression. (For readability, we have hidden the explicit dependence of the 
Wigner integral i~ on the number of variables n in its argument.) 

(1.10) I S (fx) • • • I S (f r ) = £•••£ /*((• • • ((/l * h) * h) ■ ■ •) ^ fr), 

Pr-1 Pi 

where pi, ■ ■ ■ ,p r -i range over the set specified by the first two inequalities 
in equation (1.8). (This is the range of the pt for the same reason that 
those inequalities specify the range of the pk for contraction decompositions: 
the first two inequalities in (1.8) merely guarantee that contractions are 
performed, successively, only between two adjacent blocks of n\ ® ■ ■ ■ ® n r .) 
Note: following Remark 1.22, the order the contractions are performed in 
equation (1.10) is important. 

Taking expectation in equation (1.10), note that most terms have (p = 
since any nontrivial stochastic integral is centred (as it is orthogonal to 
constants in the 0th order of chaos). Hence, the only terms that contribute 
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to the sum are those for which the iterated contractions pair all indices of 
the functions; that is, the sum is over those pi, ■ ■ ■ ,p r -i for which 2(pi + 

\-p r -i) = n\ H \-n r , so that the stochastic integral I s in the sum is 1$ . 

Since such a trivial stochastic integral is just the identity on the constant 
function inside, this shows that 

VlI S (h) ■ ■ ■ I" (fr)} = £■•■ £((" • • ((/l * / 2 ) » h) ■ ■ •) ^ fr), 

Pr-1 Pi 

where the sum is over those pi, ■ ■ ■ ,p r -i satisfying the same inequalities men- 
tioned above, along with the condition 2(p% -\ + p r -i) = n\ + • ■ ■ + n r ; 

that is, the pk satisfy inequalities (1.8). Each such iterated contraction inte- 
gral corresponds to a pairing integral of f\ ® • • • ® f r m the obvious fashion, 

((■•■((/l-/ 2 )-/ 3 )"-) P -fr)= [ /l®-®/ r . 

J-rp r _ 1 o---OT Pl 

Lemma 1.37 therefore completes the proof. □ 

Remark 1.40. Another proof of Proposition 1.38 can be achieved using 
a random matrix approximation to the free Brownian motion, as discussed 
in Section 1.2. The starting point is the classical counterpoint to Proposi- 
tion 1.38 [17], Theorem 7.33, which states that the expectation of a product 
of Wiener integrals is a similar sum of pairing integrals over respectful (i.e., 
nonflat) pairings, but in this case crossing pairings must also be included. 
Modifying this formula for matrix-valued Brownian motion, and control- 
ling the leading terms in the limit as matrix size tends to infinity using the 
so-called "genus expansion," leads to equation (1.9). The (quite involved) 
details are left to the interested reader. 

2. Central limit theorems. We begin by proving Theorem 1.6, which we 
restate here for convenience. 

Theorem 1.6. Let n be a natural number, and let (fk)keN be a sequence 
of functions in L 2 (M™ ), each with \\fk\\L 2 (M. n ) = 1- ^ e following statements 
are equivalent: 

(1) The fourth absolute moments of the stochastic integrals I^ifk) con- 
verge to 2. 

lim^(|4 s (/ fc )| 4 ) = 2. 

(2) All nontrivial contractions of converge to 0. For each p = 1, 2, . . . , 
n — 1, 

lim f k ^f k =0 mi 2 (E^- 2p ). 

PROOF. The expression |/^(/fc)| 4 is short-hand for [In(fk) ■Inifk)*] 2 - 
Since [according to equation (1.6)] In(fk)* = In (/£)> this is a product of 
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Wigner integrals, to which we will apply Proposition 1.25. First, 

n 

(2.1) I S n {f k ) ■ I S n {ft) = E^n-2 P (A * ft). 

p=0 

The Wigner integrals on the right-hand side of equation (2.1) are in differ- 
ent orders of chaos, and hence are orthogonal (with respect to the c^-inner 
product). Thus, we can expand 

= {i^h)-i s n {r k )j s n {fk)-i s n {r k )) ip 

n 

= y^(-^2w-2ip(/fc ^ fk)^2n-2p(fk ^ fk))<Pi 
p=0 

where in the second equality we have used the fact that In(fk) ' ^n(fk) * s se ^ 
adjoint. Now employing the Wigner isometry [equation (1.5)], this yields 

n 

(2.2) <p(\i%(f k )\ 4 ) = £</* * r k ,fk a r k ) L2{Kn -2 Py 

Consider first the two boundary terms in the sum in equation (2.2). When 
p = n, we have 

fk /2 - /fc = (fk,fk)L 2 (W±) = 1) 

according to Lemma 1.24(2) and the assumption that fk is normalized in I? . 

On the other hand, when p = 0, the contraction f k ^ f k is just the tensor 
product / <g> /* , and we have 

(fk ® f k ,f k ® /fc)L2( R 2n) = (/fc,/fc)L 2 (R™)(/fc 5 /fc)L 2 (R™) = 1- 

(Both terms in the product are equal to ||/fc||^ 2 = 1 5 following Definition 1.19 
of /£.) Equation (2.2) can therefore be rewritten as 

n-l 

(2.3) v{\I S n {fkt) = 2 + £ ||/ fc A / fe *||2 2(R2 „_ 2p) . 

P =i + 

Thus, the statement that the limit of (^(|I^ (/&)| 4 ) equals 2 is equivalent 
to the statement that the limit of the sum on the right-hand side of equa- 
tion (2.3) is 0. This is a sum of nonnegative terms, and so each of the terms 
must have limit 0. This completes the proof. □ 

Corollary 1.7 now follows quite easily. 

Corollary 1.7. Let n>2 be an integer, and consider a nonzero mir- 
ror symmetric function f £ L 2 (M"). Then the Wigner integral L^(f) satis- 
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fies ip[I^(f) 4 ] > 2ip[I^(f) 2 ] 2 . In particular, the distribution of the Wigner 
integral I^if) cannot be semicircular. 

PROOF. By rescaling, we may assume that ||/||£2(r»») = 1; in this case, 

equation (2.3) shows that ip[I^ (/) 4 ] > 2ip[I^(f) 2 ] 2 . To achieve a contradic- 
tion, we assume that (p[I^(f) A ] = 2ip[I^(f) 2 ] 2 = 2 [which would be the case 
if 1^ if) were semicircular]. Then the constant sequence fj. = f for all k 
satisfies condition (1) of Theorem 1.6; hence, for 1 < p < n — 1, 

/ A /* = l im f k A /* = in L 2 (R 2 + n - 2p ). 

Take, for example, p = n — 1. Let g G L 2 (R + ), so that g®g* G L 2 (M^_). Then 
we may calculate the inner product 

(/^ f*,9®9*) L 2(M.l) 

= Jif n ^ 1 r}(s,t)[g®g*}( S ,t)dsdt 

J f(s, s 2 , ■ ■ ■ , s n )f*(s n , ...,s 2 ,t)ds 2 --- ds n ^J g(s)g(t) ds dt 

= j g*(s)f(s,s 2 ,..., s n ) ■ g* (t)f(t, s 2 ,...,s n )ds dt ds 2 ■ ■ ■ ds n 

= lis ^ /IIl2 (k ™-1)- 

By assumption, / n <~^ f* = 0, and so we have g* / = for all g G L 2 (M + ). 
That is, for almost all S2, • ■ • , s n G M + , 

/ 3(s)/(s,s 2) ...,%)ds = 0. 

J 

For fixed S2 5 ■ • ■ > s n fo r which this holds, taking g to be the function g(s) = 
f(s, s 2 , . . . , s n ) yields that f(s, s 2 , . . . , s n ) = for almost all s. Hence, / = 
almost surely. This contradicts the normalization ||/||x,2(R n ) = 1- D 

We now proceed towards the proof of Theorem 1.3. First, we state a tech- 
nical result that will be of use. 

Lemma 2.1. Let ni,...,n r be positive integers, and let fi G L 2 (R™ 1 ) for 
1 < i < r. Let ir be a pairing in & 2 {n\ ® ■ ■ ■ <g> n r ). Then 

< II/i|Il 2 (E"1) • • • \\fr\\L 2 {Rl r )- 

Proof. This follows by iterated application of the Cauchy-Schwarz in- 
equality along the pairs in tt. It is proved as [17], Lemma 7.31. □ 





fl ® • ■ • ® fr 
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The following proposition shows that contractions control all important 
pairing integrals. 

Proposition 2.2. Let n be a positive integer. Consider a sequence (fk)kem 
with fk £ L 2 (W\_) for all k, such that: 

(1) fk = ft for all k; 

(2) there is a constant M > such that \\fk\\L 2 (R^_) — M for all k; 

(3) for each p = 1,2, ... ,n — 1, 

P rt r. „•_ T 2/m2n-2p\ 



lim f k — ft = in L 



Let r > 3, and let it be a connected noncrossing pairing that respects 
vr e NC%{n® r ); cf. Definitions 1.28 and 1.31. Then 



lim 

k— >oo 



Jk 



0. 



Proof. Begin by decomposing tt = r Pr _ 1 o- ■ ■ or pi following Lemma 1.36. 
There must be some nonzero pf, to simplify notation, we assume that p\ > 0. 
(Otherwise we may perform a cyclic rotation and relabel indices from the 
start.) Note also that, since ir connects the blocks of n® r and r > 2, it follows 
that p\ < n: else the first two blocks {1, . . . , n} and {n + 1, . . . , 2n} would 
form a connected component in the graph C n from Definition 1.31, so C n 
would not be connected. Set tt 1 = r Pk o • • • o t P2 , so that tt = tt' o t Pi . Then (as 
in the proof of Proposition 1.38) it follows that 



(2.4) 



/ /r= / Uk p -fk)®ft {r - 2) - 

J TV J TV' 



To make this clear, an example is given in Figure 7, with the corresponding 
iterations of the integral in equation (2.5). 

f(ti,t2,t 3 )f(t 3 ,t 2 ,t4)f(t 4 , t 5 ,t e )f(t 6 , t 5 , h) dti dt 2 dt 3 dt A dt 5 dt 6 



y®4 

2.5) 



(/ f)(ti,U)f(U, k, t 6 )f(t 6 , t 5 ,h)dti dt 4 dt 5 dt 6 



(/-/)®r 2 . 

Employing Lemma 2.1, we therefore have 



f t 
Jk 



{fk~fk)®fl 



•(r-2) 



(2.6) 



<||/fc-/fc 



Pi t II \\f \\r-2 

L2( R 2"-2 P) ■ \\}k\\ L %^n\ 



< Wfk ^ /fc|| i 2( R 2n-2p^ 



\\fk\\ r L 

M r ~ 2 , 
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n 



7T = 7T o T-2 



Fig. 7. A pairing n £ 7VC2(3 ), with the first step in its contraction decomposition (per 
Lemma 1.36). 



using assumption (2) in the proposition. But from assumptions (1) and (3), 
||/fc ^ /fc|| L 2( K 2n-2 P ^ — > 0. The result follows. □ 

We can now prove the main theorem of the paper, Theorem 1.3, which 
we restate here for convenience. 

Theorem 1.3. Let n>2 be an integer, and let (fk)keN be a sequence 
of mirror symmetric functions in L 2 (M"), each with \\fk\\L 2 (iSL n ) = 1- The 
following statements are equivalent: 

(1) The fourth moments of the stochastic integrals I^ifk) converge to 2. 

lim ^{I s n (f k f) = 2. 

fc— >oo 

(2) The random variables Inifk) converge in law to the standard semi- 
circular distribution S(0, 1) as k — > oo. 

Proof. As pointed out in Remark 1.5, the implication (2) (1) is 
essentially elementary: we need only demonstrate uniform tail estimates. In 
fact, the laws ji^ of In(fk) are all uniformly compactly-supported: by [8], 
Theorem 5.3.4 (which is a version of the Haagerup inequality, cf. [15]), any 
Wigner integral satisfies 

||^(/)||<(n + l)||/|| i2(K?) . 

Since all the functions fk are normalized in L 2 , it follows that supp/i^ C 
[—n — l,n + l] for all k. Since the semicircle law is also supported in this 
interval, we may approximate the function x i— > x 4 by a C C (R) function that 
agrees with it on all the supports, and hence convergence in distribution 
of Hk to the semicircle law implies convergence of the fourth moments by 
definition. 

We will use Proposition 2.2, together with Proposition 1.38, to prove the 
remarkable reverse implication. Since S(0, 1) is compactly supported, it is 
enough to verify that the moments of I^ifk) converge to the moments of 
S(0, 1), as described following equation (1.4). Since In(fk) is orthogonal to 
the constant 1 in the first order of chaos, In(fk) is centred; the Wigner isom- 
etry of equation (1.5) yields that the second moment of Inifk) is constantly 1 
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due to normalization. Therefore, take r > 3. Proposition 1.38 yields that 

(2-7) VV'UkT] = E / 

7re7VC* 2 (n«"-;r 7r 

Following Remark 1.33, any tt G NC2{n® r ) can be (uniquely) decomposed 
into a disjoint union of connected pairings 7r = tt\ U • • • U TT m with 7Tj £ 
A^C^n®^) for some rj's with n + • • • + r m = r. Since the decomposition 
respects the partition n® r , the pairing integrals decompose as products. 

„ m ~ 

(2.8) / /r=n/ /r- 

Assumption (1) in this theorem implies, by Theorem 1.6, that /& ^ ft — > 
in L 2 for each p £ {1, . . . ,n — 1}. Therefore, from Proposition 2.2, it follows 
that for each of the decomposed connected pairings TTi with ri > 3, the cor- 
responding pairing integral J" f® r * converges to in L 2 . Since the number 
of factors m in the product is bounded above by r (which does not grow 
with k), this demonstrates that equation (2.7) really expresses the limiting 
rth moment as a sum over a small subset of iVC^Ti® 7- ). Let ./VC|(n® r ) de- 
note the set of those respectful pairings tt such that, in the decomposition 
tt = tv i U • • • U 7r m , each rj = 2, in other words, such that the connected com- 
ponents of the graph CV each have two vertices. Thus we have shown that 

(2-9) lim v[I S nUk) T ] = £ lim / /« 

n£NC$(n®r) ■ Jn 

Note: if each rj = 2 and r = r\ + ■ • • + r m , then r = 2m is even. In other 
words, if r is odd, then NC^in® 1 ") is empty, and we have proved that all 
limiting odd moments of Inifk) ar e 0. If r = 2m is even, on the other hand, 
then the factors 7Tj in the decomposition of tt can each be thought of as 
TTi G NC2(n® n). The reader may readily check that the only noncrossing 
pairing that respects n<8> n is the totally nested pairing 7Tj = {{n, n + 1}, {n — 
l,n + 2},. ..,{1, 2n}} in Figure 1. Thus, utilizing the mirror symmetry of fk, 



fk®fk= A®/fc = ||/fc|li a(R n ) =l. 

Therefore, equation (2.9) reads 

(2.10) lim v[I S nUk? m ] = £ 1 = |iVC 2 2 K 2m )|. 

In each tensor factor of n® 2m , all edges of each pairing in tt act as one unit 
(since they pair in a uniform nested fashion as described above); this sets 
up a bijection A r C|(n® 2m ) = NC2{2m). The set of noncrossing pairings of 
[2m] is well known to be enumerated by the Catalan number C m (cf. [21], 
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Lemma 8.9), which is the 2mth moment of 5(0,1); see the discussion fol- 
lowing equation (1.4). This completes the proof. □ 

Next we prove the Wigner-Wiener transfer principle, Theorem 1.8, re- 
stated below. 

Theorem 1.8. Let n>2 be an integer, and let (fk)ke~N be a sequence of 
fully symmetric functions in L 2 (W^_). Let a > be a finite constant. Then, 
as k—too: 

(1) Wn (fk) 2 ] ^n\a 2 if and only if ip[I* (f k ) 2 ] -> a 2 ; 

(2) if the asymptotic relations in (1) are verified, then 1^ (fk) converges 
in law to a normal random variable N(0,nla 2 ) if and only ifL^(fk) converges 
in law to a semicircular random variable 5(0, c 2 ). 

Proof. Point (1) is a simple consequence of the Wigner isometry of 
equation (1.5), stating that for fully symmetric / G L 2 (M"), ip[I^(f) 2 ] = 
ll/lll (since / is fully symmetric, / = /* in particular), together with the clas- 
sical Wiener isometry which states that K[L^(f) 2 ] =n!||/|||. For point (2), 
by renormalizing fk we may apply Theorems 1.3 and 1.6 to see that I^(fk) 

converges to 5(0, 1) in law if and only if the contractions fk ^ f% = fk ^ fk 
converge to in L for p = 1, 2, . . . , n — 1. Since / is fully symmetric, these 
nested contractions fk ^ fk are the same as the contractions / (8> p / in [29] 
(cf. Remark 1.23), and the main theorems in that paper show that these 
contractions tend to in L 2 if and only if the Wiener integrals 1^ (fk) 
converge in law to a normal random variable, with variance n\ due to our 
normalization. This completes the proof. □ 

As an application, we prove a free analog of the Breuer-Major theorem 
for stationary vectors. This classical theorem can be stated as follows. 

Theorem (Breuer-Major theorem). Let (Xk)kez be a doubly-infinite 
sequence of (jointly Gaussian) standard normal random variables, and let 
p(k) = E(A'oXfc) denote the covariance function. Suppose there is an integer 
n > 1 such that Ylkez \p(k)\ n < 00 • Let H n denote the nth Hermite polyno- 
mial, 

H n (x) = (-lTe* 2 / 2 £;e- x2 / 2 . 

({H n :n> 0} are the monic orthogonal polynomials associated to the law 
N(0,1).) Then the sequence 

j m— 1 

V m = — F = y^H n (X k )^N(0,nla 2 ) asm ^00, 



m 

k=0 



where a 2 = Y,k& P( k T ■ 
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See, for example, the preprint [25] for extensions and quantitative im- 
provements of this theorem. Note that the Hermite polynomial H n is related 
to Wiener integrals as follows: if (Wt)t>o is a standard Brownian motion, 
then W\ is a N(0, 1) variable, and 

H n (W 1 )=C(l mn ). 

(See, e.g., [19].) The function Irgun is fully symmetric. On the other hand, 
if (St)t>o is a free Brownian motion, then 

^71 C 1 [0,1]") = U n (Si), 

where U n is the nth Chebyshev polynomial of the second kind, defined (on 
[-2,2]) by 

sinffn + 1)9) 

(2.11) [/„(2 cos 0) = 



sm( 



({U n :n > 0} are the monic orthogonal polynomials associated to the law 
S(0, 1); see [8, 41].) Hence, the Wigner-Wiener transfer principle Theo- 
rem 1.8 immediately yields the following free Breuer-Major theorem. 

Corollary 2.3. Let (X k )k£Z be a doubly-infinite semicircular system 
random variables 5(0,1), and let p(k) = (p(XoXf;) denote the covariance 
function with Xq. Suppose there is an integer n>l such that ^2 ke % \p(k)\ n < 
oo. Then the sequence 

^ m—l 

V m = —= V] U n {X k ) -^>5(0,cj 2 ) asm^oo, 

V ; — n 



k=0 



where a 2 = J2k£zP( k Y 



3. Free stochastic calculus. In this section, we briefly outline the def- 
initions and properties of the main players in the free Malliavin calculus. 
We closely follow [8]. The ideas that led to the development of stochastic 
analysis in this context can be traced back to [18]; [9] provides an important 
application to the theory of free entropy. 

3.1. Abstract Wigner space. As in Nualart's treatise [27], we first set up 
the constructs of the Malliavin calculus in an abstract setting, then special- 
ize to the case of stochastic integrals. As discussed in Section 1.2, the free 
Brownian motion is canonically constructed on the free Fock space ^o(Sj) 
over a separable Hilbert space $). Refer to the algebra S{$S) [generated by the 
field variables X(h) for h €.$)], endowed with the vacuum expectation state 
ip, as an abstract Wigner space. While S(Sj) consists of operators on 
it can be identified as a subset of the Fock space due to the following fact. 
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Proposition 3.1. The function 



(3.1) 



is an injective isometry. It extends to an isometric isomorphism from the 
noncommutative L 2 -space L 2 (S (Sj) , tp) onto ^o(Sj). 

In fact, the action of the map in equation (3.1) can be explicitly written in 
terms of Chebyshev polynomials [introduced in equation (2.11)]. If {hi}i e jq 
is an orthonormal basis for fj, k±, ki, ■ ■ ■ , k r are indices with kj / kj+i for 
1 < j < r, and n\,...,n r are positive integers, then 



(3.2) U ni (X(h kl )) ■ ■ ■ U nr (X(h kr ))Q = hf^ ® • ■ • ® G ^ (Sj). 



[This is the precise analogue of the classical theorem with X(-) an isonor- 
mal Gaussian process and the U n replaced by Hermite polynomials H n \ in 
the classical case the tensor products are all symmetric, hence the disjoint 
neighbors condition on the indices k±,...,k r is unnecessary] Hence, in order 
to define a gradient operator (an analogue of the Cameron-Gross-Malliavin 
derivative) on the abstract Wigner space S(Sj), we may begin by defining it 
on the Fock space J^oC^))- 

3.2. Derivations, the gradient operator, and the divergence operator. In 
free probability, the notion of a derivative is replaced by a free difference 
quotient, which generalizes the following construction. Let u : 1R — > C be a C 1 
function. Then define a function du : M x M — > C by 



The function du is continuous on M 2 since u is C . This operation is a deriva- 
tion in the following sense (as the reader may readily verify): if u,v € C 1 (]R) 
then 



Hence, du G Lf oc (R 2 ) = L 2 oc (R) ® L 2 oc (R). In other words, we can think of d 



If we restrict d to polynomials u G C[X] in a single indeterminate, then 
du G C[X, Y], polynomials in two (commuting) variables, and the same iso- 
morphism yields C[X, Y] = C[X] <S> C[X]. The action of d can be succinctly 



(3.3) 




(3.4) 



d(uv)(x, y) = u(x)dv(x, y) + du{x, y)v{y). 



as a map 



(3.5) 



d:C\R)^L 2 oc {R)®Ll c {R). 
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expressed here as 

d:C[X\->C[X\®C[X\ f 

(3.6) 

n 

x n ^^x j - x ® X n ~ 3 . 

3=1 

The operator d is called the canonical derivation. In the context of equa- 
tion (3.6), the derivation property is properly expressed as follows: 

(3.7) d{AB) = (A®l)-dB + dA-{l®B). 

It is not hard to check that d is, up to scale, the unique such derivation which 
maps C[X] into C[X] <g>C[X] (i.e., the only derivations on R are multiples of 
the usual derivative). This uniqueness fails, of course, in higher dimensions. 

Free difference quotients are noncommutative multivariate generalizations 
of this operator d (acting, in particular, on noncommutative polynomials). 
The definition follows. 

Definition 3.2. Let g/ be a unital von Neumann algebra, and let 
X G si '. The free difference quotient dx in the direction X is the unique 
derivation [cf. equation (3.7)] with the property that dx(X) = 1 ® 1. 

(There is a more general notion of free difference quotients relative to 
a subalgebra, but we will not need it in the present paper.) Free difference 
quotients are central to the analysis of free entropy and free Fisher infor- 
mation (cf. [39, 40]). The operator d plays the role of the derivative in the 
version of Ito's formula that holds for the stochastic integrals discussed be- 
low in Section 3.3; cf. [8], Proposition 4.3.2. We will use d and dx, and their 
associated calculus (cf. [40]), in the calculations in Section 4.1. We mention 
them here to point out a counter-intuitive property of derivations in free 
probability: their range is a tensor-product space. 

Returning to abstract Wigner space, we now proceed to define a free 
analog of the Cameron-Gross-Malliavin derivative in this context; it will 
be modeled on the behavior (and hence tensor-product range space) of the 
derivation d. 

Definition 3.3. The gradient operator V : ^o(Sj) — > ^a{?))®f)® 
is densely defined as follows: VO = 0, and for vectors hi, ■ ■ ■ , h n G S), 

n 

(3.8) V(/ii <g> • ■ • ® h n ) = ^{hi ® • ■ • <g) hj-i) tg) hj ® (h j+1 ® ■ • ■ ® h n ), 

3=1 

where hi ® • • • Cg> hj-i = Q when j = 1 and /ij+i <8> • • • <8> h n = Q when j = n. 
In particular, V/i = (8> h ® J7. 
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The divergence operator 8:J^q($)) <S>fj <8> ^o(fi) — > &o(Sj) is densely de- 
fined as follows: if hi, . . . , h n and gi, . . . ,g m and h are in $), then 

5 ((hi ® • • • <g> h n ) ®h®(gi®---® g m )) 

(3.9) 

= hi®---®h n ®h®gi®---® g m . 

These actions, on first glance, look trivial; the important point is the 
range of V and the domain of 5 are tensor products, and so the placement 
of the parentheses in equations (3.8) and (3.9) is very important. When we 
reinterpret V, 5 in terms of their action on stochastic integrals, they will 
seem more natural and familiar. 

The operator Nq = <5V:«^o(-£)) ~~ > ■^o(£j) is the free Ornstein-Uhlenbeck 
operator or free number operator; cf. [5]. Its action on an n-tensor is given by 
No(hi ® • • • <B> h n ) = nhi ® • • • ®h n . In particular, the free Ornstein-Uhlenbeck 
operator, densely defined on its natural domain, is invertible on the orthog- 
onal complement of the vacuum vector. This will be important in Section 4. 
It is easy to describe the domains &(Nq) and ^(N^ 1 ); we will delay these 
descriptions until Section 3.6. 

Definition 3.3 defines V, 5 on domains involving the algebraic Fock 
space ^algC^j) (consisting of finitely-terminating sums of tensor products 
of vectors in Sj). It is then straightforward to show that they are closable 
operators, adjoint to each other. The preimage of ^aig(^) under the isomor- 
phism of equation (3.1) is actually contained in S(Sj): Equation (3.2) shows 
that it consists of noncommutative polynomials in variables {X(h),h € fj}. 
Denote this space as 5 a i g (•£})• We will concern ourselves primarily with the 
actions of V, 5 on this polynomial algebra (as is typical in the classical setting 
as well). Note, we actually identify 5 a i g (i}) as a subset of JP(Sj) via Proposi- 
tion 3.1, therefore using the same symbols V, 5 for the conjugated actions of 
these Fock space operators. Under this isomorphism, the full domain £^(V) 
is the closure of <S a i g (io); similarly, @(Nq) and 3i(Nq 1 ) have 5 a i g (fj) (minus 
constants in the latter case) as a core. 

Proposition 3.4. The gradient operator V:S a i g (i}) — >■ <S a ig(-£>) ® ft <8> 
S a lg(i5) is a derivation. 

(3.10) V(AB)=A-(VB) + (VA)-B, A,B e S a i g (io). 

In equation (3.10), the left and right actions of 5 a i g (fj) are the obvious 
ones A-(U®h®V) = (AU)<g>h®V and (U ®h®V) ■ B = U®h®(VB). This 
is the same derivation property as in equation (3.7). In particular, iterating 
equation (3.10) yields the formula 

n 

(3.11) V(X(hi) ■ ■■X(h n )) = Y^X(hi) ■ ■■X(hj-i)®h j ®X(h j+ i) ■ ■ ■ X(h n ). 

3=1 
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When n = 1, equation (3.11) says VX(h) = 1 £?> h (8) 1, which matches the 
classical gradient operator (up to the additional tensor product with 1). 

As shown in [8], both operators V and 5 are densely defined and closable 
operators, both with respect to the L 2 (ip) [or L 2 (tp® tp)] topology and the 
weak operator topology. It is most convenient to work with them on the 
dense domains given in terms of 5 a i g . 

We now state the standard integration by parts formula. First, we need 
an appropriate pairing between the range of V and $) , which is given by the 
linear extension of the following. 

{•, -}j5 : (<Salg(£) <8> S) ® SalgGft)) X Sj -> cS a lg(i5) (g> SalgCS), 

(3.12) 

(A ® /ix <8> £?, /i 2 )^ = (hi,h 2 )A <8> B. 

In the special case f) = L 2 (IR + ) to which we soon restrict, this pairing is 
quite natural; see equation (3.17) below. The next proposition appears as [8], 
Lemma 5.2.2. 

Proposition 3.5 (Biane, Speicher). IfY G <S a lgCfi) andheSj, 

(3.13) <p®<p((VY,h)si) = <p(Y-X(h)). 

Remark 3.6. Since (VY, h)^ is in the tensor product <S a igC#) ® >?aig (•£>) > 
its expectation must be taken with respect to the product measure ip® (p. 

3.3. Free stochastic integration and biprocesses. We now specialize to 
the case f) = L 2 (R + ). In this setting, we have already studied well the field 
variables X{h). 

(3.14) X{h) = if (fc) = J h(t) dS t . 

[Equation (3.14) follows easly from the construction St = X(lr £i) of free 
Brownian motion.] To improve readability, we refer to the polynomial alge- 
bra 5 a ig(-L 2 (lR+)) simply as 5 a i g ; therefore, since St = X(1i a), 5 a i g contains 
all (noncommutative) polynomial functions of free Brownian motion. The 
gradient V maps 5 a ig into 5 a i g (8>L 2 (IR + ) (8>iS a i g . It is convenient to identify 
the range space in the canonical way with vector- valued L 2 -functions. 

cSalg ® L 2 (R + ) (8) 5alg L 2 (M + ;5 a lg ® Sag). 

That is, for Y £ S a lg> we may think of VY as a function. As usual, for t > 0, 
denote (VY)(t) = V*Y. Thus, VY is a noncommutative stochastic process 
taking values in the tensor product 5 a ig <8>5 a ig- 

Definition 3.7. Let (s/,(p) be a W* -probability space. A biprocess is 
a stochastic process tt-^Ut€ s/ (8> si ' . For 1 < p < oo, say U is an L p bipro- 
cess, U € 88 p, if the norm 

roc 

(^•15) \W\\sS p = I \\Ut\\LP(sa?'<g)*?,<p®<p)dt 
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is finite. (When p = oo the inside norm is just the operator norm of Ut in 
si® si.) 

Let {g/t - t>0} be a filtration of subalgebras of si ; say that U is adapted 
if U t esit® si t for all t > 0. 

A biprocess is called simple if it is of the form 

n 

(3-16) U = Y,A ] ®B J l [tj _ utj) , 

i=i 

where = to < t\ < ■ ■ ■ < t n and Aj,Bj are in the algebra si. The simple 
biprocess in equation (3.16) is adapted if and only if Aj,Bj S = e 4 3 -_i for 
1 < J < The closure of the space of simple biprocesses in ^ p is denoted £8^, 
the space of L p adapted biprocesses. 

Remark 3.8. Customarily, our algebra si will contain a free Brownian 
motion S = (St)t>o, and we will consider only nitrations s/ t such that S s S s/ t 
for s < t. Thus, when we say a process or biprocess is adapted, we typically 
mean with respect to the free Brownian filtration. 

So, if Y e <Sai g , then VY is a biprocess. Since 5 a i g consists of polynomials 
in free Brownian motion, it is not too hard to see that VY £ S$ v for any 
p > 1 (cf. [8], Proposition 5.2.3). Note that the pairing of equation (3.12), in 
the case S) = L 2 (M + ), amounts to the following. If U G 9&i is an 1? biprocess 
and he L 2 (R + ), then 

(3.17) (U,h) LHR+) = [ UMt)dt. 

We now describe a generalization of the Wigner integral j h(t) dSt to allow 
"random" integrands; moreover, we will allow integrands that are not only 
processes but biprocesses. (If X t is a process, then X t <g> 1 is a biprocess, so 
we develop the theory only for biprocesses.) 

Definition 3.9. Let U = J2]=i Aj ® Bjl^._ utj ^ be a simple biprocess, 
and let S = (St)t>o be a free Brownian motion. The stochastic integral of U 
with respect to S is defined to be 

/n 
U t WSt = Y, A i(St j -S t ._ 1 )B j . 

Remark 3.10. The JJ-sign is used to denote the action of Ut on both 
the left and the right of the Brownian increment. In general, we use it to 
denote the action of si ® stf on by (A <g> B)$C = ACB; more generally, 
for any vector space SC , it denotes the action of si % si on s/ ® SC ® si by 
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(A <g> B)$(C ®X®D) = (AC) ig) X ig> (-DB). Since the second tensor factor 
of =e/ acts on the right rather than the left, it might be more accurate to 
describe j) as an action of srf ®£/ op , where the opposite algebra g/ op is equal 
to £? as a set but has the reversed product. 

Remark 3.11. Let U be a simple biprocess as in equation (3.16). If Ay- 
are constant multiples of the identity, and Bj = 1, then the stochastic in- 
tegral in Definition 3.9 reduces to the Wigner integral, jUt^dSt = lf(h) 
where h = £"=i Ajl [t ._ ut .y 

Let U be an adapted simple biprocess. A standard calculation, utiliz- 
ing the freeness of the increments of (St)t>o, yields the general Wigner-Ito 
isometry, 



(3.19) 



UAdSt 



W\\.<%2- 

This isometry therefore extends the definition of the stochastic integral to 
all of S$2 by a density argument (since simple biprocesses are dense in SB^ )• 

3.4. An ltd formula. There is a rich theory of free stochastic differential 
equations based on the stochastic integral of Definition 3.9 (cf. [11-13]) which 
mirror classical processes (like the Ornstein-Uhlenbeck process) in the free 
world, and [14] which uses free SDEs for an important application to ran- 
dom matrix ensembles and operator algebras. The stochastic calculus in this 
context is based on a free version of the ltd formula, [8], Proposition 4.3.4. It 
involves the derivation d in place of the first order term; in order to describe 
the appropriate ltd correction term, we need the following definition. 

Definition 3.12. Let ji be a probability measure on R all of whose 
moments are finite. Define the operator A^iCfX] — > C[X] on polynomials 
as follows: 

(3.20) A„h(x)=2^- / dh(x,y)n(dy). 

dx J R 

The ltd formula in our context applies to ltd processes of the form Mf = 
M + Jo U 8 i dS s + £ K s ds. For our purposes, it suffices to take U s = l[ 0;t ]l®l 
so that the stochastic integral J* U s § dS s is just the free Brownian motion St, 
and so we state the formula only in in this special case. 

Proposition 3.13 (Biane, Speicher). Let K = (K t )t>o be a self-adjoint 
adapted process. Let Mq be self adjoint in L 2 (S,(p), and let M = (Mt)t>o be 
a process of the form 

r-t 

K. q ds. 

'o 



(3.21) M t = M + S t + I 

Jo 
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Let h € C[X] be a polynomial, and let At denote the operator At = A^ Mt ; 
cf. equation (3.20). Then 

(3.22) h{M t ) = h(M )+ [ dh(M 8 )$dM s + \ I A s h(M s )ds. 

Jo * Jo 

Remark 3.14. In equation (3.22), we are viewing the function dh as 
living in C[X] <S>C[X] directly rather than C[X, Y]. In particular, if h(x) = x n 
then dh(X) = 52k=i xk ~ 1 ® xn ~ k - 

Remark 3.15. Of course, given equation (3.21) defining Mj, the integral 
J Q dh(M s )$dM s in equation (3.22) is shorthand for 
d r-t r-t 

dh(M s )$dM s = / dh{M s )$dS s + / dh(M s )$K s ds, 



10 Jo Jo 

following standard conventions of stochastic calculus. 

We will use Proposition 3.13 in the calculations in Section 4.1 below. It 
will be convenient to extend the Ito formula beyond polynomial functions h 
for this purpose. The canonical derivation d of equation (3.4) makes sense 
for any C 1 -function h; we restrict this domain slightly as follows. Suppose 
that h is the Fourier transform of a complex measure v on R, 

(3.23) h{x) = v{x) = I e ix ^v(di). 



By definition, a complex measure is finite, and so such functions h are con- 
tinuous and bounded, h € Cfe(M). In order to fit into the Ito framework, 
such functions must be L 2 in the appropriate sense. In the context of equa- 
tion (3.23), the relevant normalization is as follows. 

Definition 3.16. Let h have a Fourier expansion as in equation (3.23). 
Define a seminorm ^(h) on such functions h by 

(3.24) ,Mh)= [ e\v\(dt). 

Jk 

Denote by C2 the set of functions h with J^(^) < 00. 

Remark 3.17. J?2 is not a norm: if h = a € C is a constant function, 
then h = ado, and ^i{h) = f £ 2 |a|<5o(d£) = 0. It is easy to check that J^2 is 
a seminorm (i.e., nonnegative and satisfies the triangle inequality), and that 
its kernel consists exactly of constant functions in C2. Indeed, the quotient 
of C2 by constants is a Banach space in the descended ^-norm. 

Standard Fourier analysis shows that C2 C C?(M) (bounded twice-contin- 
uously-differentiable functions), where ^(h) is like a sup-norm on the sec- 
ond derivative h". In particular, nonconstant polynomials are not in C2. For 
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our purposes, we are only concerned with applying polynomials to bounded 
operators, meaning that we only care about their action on a compact subset 
of R. In fact, locally any C°° function is in 62- 

Lemma 3.18. Let r > 0. Given any C°° function h:M.—>C, there is 
a function h r E C2 such that h(x) = h r (x) for \x\ < r. 

Proof. Let ip r be a function such that ip r {x) = 1 for |x| < r. Then ip r h 
is equal to h on [— r, r]. This function is , and hence its inverse Fourier 
transform {ip T h) y is in the Schwartz space of rapidly-decaying smooth func- 
tions. Set v r {d£) = (ip r h) v (£) <i£; then u r has finite absolute moments of all 
orders, and h r = v r = ip r h is in C2 and is equal to h on [— r, r\. □ 

In particular, polynomials are locally in the class 62- Later we will need 
the following result which says that resolvent functions are globally in C2. 

Lemma 3.19. For any fixed z in the upper half-plane C+, the function 
Pz(x) = tr^ is in C 2 . 

Proof. The resolvent p z is the Fourier transform of the measure v z {d£,) = 
_i e —iz£, ±, oo Q i (^) d£; a simple calculation shows that J^(p^) = 2(^sz)~ 3 when 
%z>0. □ 

The next theorem is a technical approximation tool which will greatly 
simplify some of the more intricate calculations in Section 4.1. 

K P 

Theorem 3.20. Let K be a compact interval in R. Denote by C 2 ' the 
subset of C2 consisting of those functions ^C-^ that are equal to polynomials 
on K. If /i G C2, there is a sequence h n S C 2 ' such that: 

(1) ^(hn) ->■ / 2 (ft) asn-^oo; 

(2) if [i is any probability measure supported in K , then j h n d[i^r J hdfi. 

In fact, our proof will actually construct such a sequence h n that con- 
verges to h pointwise as well, although this is not necessary for our intended 
applications. The proof of Theorem 3.20 is quite technical, and is delayed 
to Appendix. 

Since C2 C C 2 (]R), the operator d makes perfect sense on C2 (and has L 2 - 
norm appropriately controlled); we can then reinterpret the function dh £ 
C^M 2 ) as an element of Li oc (R) ® L 2 oc (R) so it fits the notation of the 
Ito formula equation (3.22). It will be useful to have a more tensor-explicit 
representation of the function dh for h £ C2 in the sequel. If h = v, then 



(3.25) 
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Under the standard tensor identification, we can rewrite equation (3.25) as 

(3.26) Oh(Y)= f [ i£(e ia t Y ®e i(1 - a) t Y )v(dOda. 

Jo Jr 

As for the Ito correction term in equation (3.22), two applications of the 
Dominated Convergence Theorem show that the operator A M of Defini- 
tion 3.12 is well defined on h G C2 whenever [i is compactly-supported, and 
the resulting function A^h is continuous. As such, all the terms in the Ito 
formula equation (3.22) are well defined for h G C2, and standard approxi- 
mations show the following. 

COROLLARY 3.21. The Ito formula of equation (3.22) holds for h G<?2- 

Remark 3.22. The evaluations of the functions h, dh, and Afh on 
the noncommutative random variables Mq and Mj are given sense through 
functional calculus; this is possible (and routine) because Mq and M t are 
self adjoint. 

3.5. Chaos expansion for biprocesses. Recall the multiple Wigner inte- 
grals 1% as discussed in Section 1.3. By de-emphasizing the explicit de- 
pendence on n, I s can then act (linearly) on finite sums ^ n f n of func- 
tions f n G L 2 (Wl) = L 2 (M + )® n ; that is, I s acts on the algebraic Fock space 
-^alg = ^alg(I 2 (R+))- Utilizing the Wigner isometry, equation (1.5), this 
means I extends to a map defined on the Fock space, 

(3.27) I s :^ ^L 2 (S,<p); 

here and in the sequel, ^ = ^o(£ 2 (IR+)) and S = S(L 2 (R + )). In fact, the 
map in equation (3.27) is an isometric isomorphism; this is one way to state 
the Wigner chaos decomposition. This extended map I s is the inverse of the 
map Y 1 — y Y£l of Proposition 3.1. 

For n,m positive integers, define for / G L 2 (R") <g> L 2 (R™) ^ L 2 (R" +m ) 
the Wigner bi-integral 

(3.28) [if®if](/) = J f(t 1 ,...,t n ;s 1 ,...,s m )dS tl ---dS tn ®dS Sl ---dS Sm . 

To be clear: if / = g ® h with g G L 2 (R™ ) and h G L 2 (R™), then [if ® 
if ](/) = if id) ® imWi i n general, if <8> if is the L 2 -closed linear extension 
of this action. Thus, 

if if : L 2 (R™ ) ® L 2 (R™) -> L 2 {S ®S,ip®<p). 

The Wigner isometry [cf. equation (1.5)] in this context then says that if 
/ G L 2 (M.l) ® L 2 (R™) and g G L 2 (R™') ® L 2 (R™'), then 

^®<^([if ®4'](5)*[if ®if](/)) 

(3.29) 

' (/>#)i 2 (H?p®l, 2 (K™) ; if n = n' and m = m', 
0, otherwise. 
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This "bisometry" allows us to put the 1^ <g> 1^ together for different n, m as 
in equation (3.27), to yield an isometric isomorphism 

(3.30) I s ®I s :<^ (g)^o->L 2 (S(g)S,tp(g>(p). 

What's more, by taking these Hilbert spaces as the ranges of vector-valued 
L 2 (M + )-functions, and utilizing the isomorphism L 2 (IR_|_; 21 <g> 05) = 21 <8> 
L 2 (M_|_) ® 05 for given Hilbert spaces 21, 05, we have an isometric isomor- 
phism 

(3.31) I s <g> I 5 : L 2 (R + ;^ ® &o) -> & 2 - 

Here ^2 denotes the L 2 biprocesses (cf. Definition 3.7), in this case taking 
values in S <g> S. If / G L 2 (R + ; J^o (8) =^o)> the bi-integral acts only on com- 
ponents: [J 5 ® I 5 ] (/)(£) = [I s ® I 5 ] (/ t ). Equation (3.31) [through the action 
defined in equation (3.28)] is the Wigner chaos expansion for L 2 biprocesses 
in the Wigner space. 

As in the classical case, adaptedness is easily understood in terms of the 
chaos expansion. If U G 38 1, it has a chaos expansion U= [I s ® I s ](f) for 
some / G L 2 (R + ; ^0 ® &o), which we may write as an orthogonal sum 

00 

f:t^f t = ft'™, 

n,m=0 

where /™' m G L 2 (M™ ) ® L 2 (]R™). Then [/ is adapted (in the sense of Defini- 
tion 3.7) if and only if for each n, m and t\, . . . , t n , s\, . . . , s m > 0, the ker- 
nels f™' m (ti, . . . ,t n ; s±, . . . , s m ) are adapted, meaning they are whenever 
max{ii, . . . ,t n , s±, . . . , s m } > t. In this case, the stochastic integral defined 
in equations (3.18) and (3.19) can be succinctly expressed; cf. [8], Proposi- 
tion 5.3.7. In particular, if / n ' m G L 2 (K+; L 2 (M") <g>L 2 (IR™)) is adapted, then 

[ [I s ®I s ](f t )#dS t 

(3.32) 

= / ft' m { t ii---i t n,si,...,s m )dS tl --- dS tn dS t dS Sl --- dS Sm . 

This is consistent with the notation of equation (3.28); informally, it says that 

(dS tl ■ ■ ■ dS tn ® dS Sl ■ ■ ■ dS Sm )$dS t = dS tn ■ ■ ■ dS tl dS t dS Sl ■ ■ ■ dS Sm 
as one would expect. 

3.6. Gradient and divergence revisited. Both the gradient and the di- 
vergence have simple representations in terms of the chaos expansions in 
Section 3.5. 

Proposition 3.23 (Propositions 5.3.9 and 5.3.10 in [8]). The gradient 
operator is densely -defined and closable in 

V:L 2 {S,ip)^m 2 . 
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Its domain £#(V), expressed in terms of the chaos expansion for L 2 (S,ip), 
s ( 

I s '(f), then Y G 9(V) if and only if 



s as follows. If / = En /n e with f n G L 2 (M.%), and if L 2 {S, tp) 3 Y 



(3.33) y^n||/ ra ||1 



/"lli 2 (R") 
n=0 



In this case, the quantity in equation (3.33) is equal to the norm 

„ oo 

/ H^^lli 2 ^®^®^) dt = y~]n\\fn\\l2 (R n ) . 
Moreover, the action of V on this domain is determined by 
V t (J f(t 1 ,...,t n )dS tl ---dS tn ) 

n „ 

-iiMfc+i) ■■■,t n ) dSt x ■ ■ • dSt k _ 1 <g> dSt k+1 ■ • • dSt n . 



(3.34) 



fc=i ' 



Remark 3.24. It is similarly straightforward to write the domain of the 
free Ornstein-Uhlenbeck operator in terms of Wigner chaos expansions. If 
Y = I s (f) where / = ^/n£ then Y G ®(N ) iff En™ 2 |l/nllI*(R») < 

oo. Likewise, Y G 9(N^ 1 ) iff f = and Era>o n ~ 2 |l/nll|2(R») < oo. In par- 
ticular, we see that 

(3.35) 9{N ) C 9(V), 0(V) e image(I 5 ) C 9(N l ). 

The divergence operator can also be simply described in terms of the 
chaos. We could similarly describe its domain, but its action on adapted 
processes is already well known, as in the classical case. 

Proposition 3.25 (Propositions 5.3.9 and 5.3.11 in [8]). The divergence 
operator is densely defined and closable in 

5:&2^L 2 (S,tp). 

Using the chaos expansion for biprocesses, the action of 5 is determined as 
follows. If f £L 2 (R + ;L 2 {Rl)®L 2 (R™)), then 



n ; si, . . . , s m ) dS tl ■ ■ ■ dS tm <g> dS Sl ■■■ dS s 

(3.36) U 

= / ft{ti,---,t n ;si,...,t m )dS tl --- dS tn dS t dS Sl --- dS Sm . 

In particular, comparing with equation (3.32), if U is an adapted biprocess 
U G 08%, then U G 9(5) and 



5(U) = f U4dS t . 
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Remark 3.26. In light of the second part of Proposition 3.25, the diver- 
gence operator is also called the free Skorohod integral. To be more precise, 
as in the classical case, there is a domain L 1 ' 2 in between and the nat- 
ural domain 3i{8) on which 8 is closable and such that for U € L 1 ' 2 the 
relation Vt(S(U)) = Ut + 5 s (VtU s ) holds true. It is this restriction of 5 that 
is properly called the Skorohod integral. 

Remark 3.27. Given a random variable X € ^(V), using the deriva- 
tion properties of the operators dx (cf. Definition 3.2) and V, it is rela- 
tively easy to derive the following chain rule. If p € C[X] is a polynomial, 
then 

(3.37) Vp(X) = d x p{X)p7X. 

We conclude this section with one final result. The space of I? adapted 
biprocesses SB^ is a closed subspace of the Hilbert space cf. Defini- 
tion 3.7. Hence there is an orthogonal projection Y:S$2 —> 3$2- The nex t 
result is a free version of the Clark-Ocone formula. It can be found as [8], 
Proposition 5.3.12. 



Proposition 3.28. If X € #(V), then 

x = tp(x) + 5(rvx). 



4. Quantitative bounds on the distance to the semicircular distribution. 

As described in the restricted form of Theorem 1.10 in Section 1, we are pri- 
marily concerned in this section with quantitative estimates for the following 
distance function on probability distributions. 

Definition 4.1. Given two self-adjoint random variables X, Y, define 
the distance 

d C2 (X, Y) = sup{|^[/i(X)] - ip[h(Y))\ :heC 2 , J?2{h) < 1}; 
the class C2 and the seminorm J*2 are discussed in Definition 3.16. 

Remark 4.2. Note that we could write the definition of d^ 2 (X,Y) 
equally well as 



sup 



h dux — h dfxy 



:heC 2 ,^2(h) < 1 



In this form, it is apparent that dc 2 (X,Y) only depends on the laws jix 
and jJLy of the random variables X and Y. In computing it, we are therefore 
free to make any simplifying assumption about the correlations of X and Y 
that are convenient; for example, we may assume that X and Y are freely 
independent. 
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Lemma 3.19 shows that resolvent functions p z (%) = (z — x) 1 are in C2 
for z € C+, and in fact that if Qz = 1, then ^{Pz) — 2. Thus, 

1 , „ r . 1 

2 



d C2 (X,y)>- sup \<p[(z-X)- 1 ]-<p[(z-Y)- 1 ]\ = - sup |G MA .(z)-G My (z)|, 



where G fl (z) = f R (z — x)~ 1 p(dx) is the Stieltjes transform of the law /z. It 
is a standard theorem that convergence in law is equivalent to convergence 
of the Stieltjes transform on any set with an accumulation point, and hence 
this latter distance metrizes converge in law; so our stronger distance dc 2 
also metrizes convergence in law. The class C2 is somewhat smaller than 
the space of Lipschitz functions, and so this metric is, a priori, weaker than 
the Wasserstein distance (as expressed in Kantorovich form; cf. [10, 33]). 
However, as Lemma 3.18 shows, all smooth functions are locally in C2] the 
relative strength of dc 2 versus the Wasserstein metric is an interesting ques- 
tion we leave to future investigation. 

4.1. Proof of Theorem 1.10. We begin by restating Theorem 1.10 in the 
language and full generality of Section 3. 

Theorem 1.10. Let S be a standard semicircular random variable; cf. 
equation (1-4)- Let F be self adjoint in the domain of the gradient, F € 
f(V)cL 2 (5,^), with<p(F) = 0. Then 

(4.1) dc 2 (F,S) <^<p<8xp 



J VsiN^F^VsFYds-l®! 



Proof. The main idea is to connect the random variables F and S 
through a free Brownian bridge, and control the differential along the path 
using free Malliavin calculus; cf. Section 3. For < t < 1, define 



(4.2) F t = VT=tF + s t , 

where St is a free Brownian motion. In particular, S\ has the same law as 
the random variable S. Since dc 2 (F, S) depends only on the laws of F and S 
individually, for convenience we will take St freely independent from F. 
Fix a function h 6 62- In the proceeding calculations, it will be useful to 
assume that h is a polynomial; however, polynomials are not in €2- Rather, 
fix a compact interval K in R that contains the spectrum of Ft for each 
t € [0, 1]; for example, since \\F t \\ < 2y/i + \/l — t\\F\\, we could choose K = 
[—2 — 11-^11,2 + ||-F||]. For the time being, we will assume that h is equal to 
a polynomial on K; that is, we take h€C 2 ' ; cf. Theorem 3.20. 

Define g(t) = <p[h(Ft)]. The fundamental theorem of calculus yields the 
desired quantity, 

(4.3) <p[h{S)] - (p[h(F)] = ^(Fi)] - <p[h(F )] = g(l) - g(Q) = f g'(t) dt. 

Jo 
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We can use the free ltd formula of equation (3.22) to calculate the deriva- 
tive g'(t). In particular, dF t = — ^-J==^ Fdt + dSt, and so applying equa- 
tion (3.22) yields 

d[h{F t )\ = dh(F t )$dF t + ±Ath(F t )dt 

( 4 - 4 ) 

1 „ . .„ 1 1 



= dh(F t W-—^==Fdt + dS t [ j + -A t h(F t ) dt. 

Linearity (and uniform boundedness of all terms) allows us to exchange <p 
with stochastic integrals; in particular, we may write dg(t) = ip(d[h(F t )]) . 
The (stochastic integral of the) term dh(Ft)$dSt has mean 0, and so we are 
left with two terms, 

(4.5) dg(t) = i!--J=<p[dh(F t )V?\ + tp[A t h(F t )]\dt. 

The following lemma allows us to simplify these terms. 

Lemma 4.3. Let X and Y be self-adjoint random variables. Let h £ ^2- 

(a) <p[dh(y)pC\=<p[h'(Y)X\. 

(b) <p[A ltY h(Y)]=<p®<p[dh'(Y)]. 

Proof of Lemma 4.3. By assumption h takes the form h = v for some 
complex measure v with finite second absolute moment. 

(a) We use the representation of equation (3.26) for d, so that 

dh{Y)$X= I da I iiv{d£){e ia( > Y ® e i{l ~ a)liY )$X 
Jo Jr 



(4.6) 



f 1 da [ i^(dOe ia ^ Y Xe i{1 - a ^ Y . 
Jo Jr 



Since ip is a trace, (p[e ia ^ Y Xe^ 1 a ^ Y ] = Lp[e l ^ ¥ X]. Taking cp of both sides of 
equation (4.6), the a integration just yields a constant 1, and so 

(4.7) <p[dh(Y)pC\= [ i^(dO<p[e^ Y X] = J( [ i^ Y u(do)x 
Jr L\Jr J 

Since h'(x) = f m i^e l ^ x u(d^), this yields the result. 

(b) By Definition 3.12, A flY h(x) = 2^ J R dh(x, y)//y(dy). Using the chain 

rule, we can express dh(x, y) = h'(ax + (1 — a)y) da. Since h € C 2 and the 
integrand is bounded, we can rewrite A^hix) as 

A„„h(x) = 2— [ ayidy) [ dah! (ax + (1 — a)y) 
dx J R J 



WIGNER CHAOS AND THE FOURTH MOMENT 43 

(48) - f 1 

fj,y(dy) / 2adah"(ax + (1 — a)y). 



o 



Now h"{x) = / R -£ 2 e* x v(d£), and so 

A flY h(x) = - [ fv(d£) C 2ada [ fi Y (dy)e i( - 1 ~ a ^ y e ia ^ x 
Jr Jo Jr 



(4.9) 



fv(d£) f 1 2adae ia ^ x ip[e l(1 ^ Y } 
Jo 



Evaluating at x = Y and taking the trace, this yields 

(4.10) <p[A^h(Y)}=- [ fv(dO f 2ada ( p[e i ^ Y Me< 1 - a ^ Y ]. 



On the other hand, following the same identification as in equation (3.26), 
we have 



(4.11) dh'(Y) = - [ da I' t 2 v{d£)e 

Jo Jr 



iaZY ^ e i(l-a)£y _ 



Taking the trace yields 

(4.12) <p®<p[dh'(Y)] = - f da [ fu(dOf[e ia ^ Y Me i(1 ^ Y }. 

Jo Jr 

Subtracting equation (4.11) from equation (4.12) and using Fubini's theorem 
(justified since the modulus of the integrand is < £ 2 which is in L l (v x [0, 1])) 
yields 

tp®ip[dti(y)]-tp[^ Y h{Y)] 

(4.13) 

{2a -I) da [ fu{dCMe ia ^ Y Me i{1 - a ^ Y }. 



Equation (4.13) expresses the difference tp® tp[dh'(Y)] — f[A^ Y h(Y)] as an 
integral of the form j (2a — l)n(a) da, where k is a function with the sym- 
metry k(oc) = k(1 — a). The substitution a i— >■ 1 — a shows that any such 
integral is 0, which yields the result. □ 

We now apply Lemma 4.3 to equation (4.5) with X = F and Y = Ft; 
note that A t h(F t ) is by definition (cf. Proposition 3.13) equal to h(Ft). 
Equation (4.5) then becomes 



(4.14) 



g \t) = \{--^ t rtti{F t )F] + v®v[dh\F t ))V 
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At this point, we invoke the free Malliavin calculus of variations (cf. Sec- 
tion 3) to re-express these two terms. For the first term, we use a standard 
trick to introduce conditional expectation; by Definition 1.13, <p[h'(Ft)F] = 
(p[F-ip[h'(F t )\F]\. Since F G 3>(V) and tp(F) = 0, equation (3.35) shows that 
F G ^(iVo" 1 ), and so F = «5(VAT _1 F). Hence 

(4.15) <p[ti(F t )F] = <p[Fti(F t )] = tpWVN^F) ■ <p[h' (F t )\F]} . 

The right-hand-side of equation (4.15) is the I? (S, ip) -inner-product of 
diVN^F) with ip[ti(F t )\F]* = <p[h/(F t )\F] (since F and F t are self adjoint), 
and this random variable is in the domain @(V). Hence, since 5 and V are 
adjoint to each other, we have 

<p[ti(F t )F] = {VN Q l F,VvW{F t )\F}) S2 

(4.16) 

= / y®ip{V s {N^F)i{V s y\h>{F t )\F}T)ds. 

To be clear: Jj is the product (A x ® B l )^(A 2 ® B 2 ) = {AiA 2 ) <8> (B 2 Bi). It 
is easy to check that this product is associative and distributive, as will be 
needed in the following. 

Recall that Ft = y \ — tF + St and hi is equal to a polynomial on a com- 
pact interval K which contains the spectrum of Ft. Hence, h'(Ft) is a (non- 
commutative) polynomial in F and St- Thus, the conditional expectation 
ip[h f (Ft)\F] is a polynomial p(F) in F. We may thus employ the chain rule 
of equation (3.37) to find that, for each s, 

(4.17) V,<pft(F t )\F] = d F <p[hl(F t )\F}$V s F. 
Taking adjoints yields 

(4.18) (V s <p[hl(F t )\F])* = {V s F)*$d F <p[ti(F t )\F]. 

Now we use the intertwining property of the free difference quotient for 
the sum of free random variables with respect to conditional expectation 
(see, [40], Proposition 2.3) and the simple scaling property d a x = a~ 1 dx 
(for a G C) to get 

d F <p[h'(F t )\F] = d F ip[ti(VT=tF + S t )\F] 

= Vl^td^p^h' {Vl^tF + S t )\F] 

(4.19) 

= Vl^tip ® tpid^F+^h'iVT^iF + S t )\F] 
= VT=i<p®<p[dti(F t )\F]. 

Remark 4.4. It is here, and only here, that the assumption that St is 
free from the F is required. 



WIGNER CHAOS AND THE FOURTH MOMENT 45 

Combining equation (4.19) with equations (4.16) and (4.18) yields 
<p[h'(F t )F] 

(4.20) 

= VT^tcp®v( f V s {Nv 1 F)$(V s F)*ds$<p®<p[dti(F t )\F]\. 



As for the second term in equation (4.14), using property (3) of conditional 
expectation (cf. Definition 1.13) and taking expectations, we express 

(4.21) ip ® tp[dti(F t )] =tp®v(v® <p[dti(F t )\F]). 

Combining equations (4.14), (4.20) and (4.21) yields 

9'(t) = -\^p 



(4.22) 



tp\ I V s {NQ 1 F)%{V s F)*dsfo®v[dti{F t )\F] 
V®y[dti{F t )\F] 



1 



J VsiN^F^VsF)* ds - 1 ® lj $<p ® <p[dti(F t )\F] | . 



2 

Integrating with respect to t and using equation (4.3) gives 
<p[h(S)]-<p[h(F)} 

(4.23) = ~<p 



® ip 



J VsiN^FMVsF)* ds- <p®<p[dti(F t )\F}dt\. 



Applying the noncommutative L^-L 00 Holder inequality (which holds for 
the product jj on the algebra S since (j is really just the natural product on 
the algebra S ® 5 op ; cf. Remark 3.10) gives us 

\<p[h(F)]-<p[h(S)]\ 
(4.24) <^<f<S><p 



j VsiN^FMVsFYds-l®! 

[ \\<p®<p[dti(F t )\F]\\ S9S dt. 
Jo 



The norm || • \\s®s is the operator (L°°) norm on the doubled abstract Wigner 
space. The conditional expectation is an L°° -contraction [cf. property (2) in 
Definition 1.13], and so the second term in equation (4.24) satisfies 

(4.25) f \\<P®<p[dh'(F t )\F]\\ S9S dt< [ \\dti(F t )\\ S9 sdt. 
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Using equation (4.11) with Y = Ft, note that 

\\dti(F t )\\ s ® s = ['da [ fu(dOe 
Jo ii 

(4.26) 



< da fv{dZ)\\^ Ft \\s\\e i(1 - a) t Ft \\ s . 
Jo Jr 

Both of the norm terms in the second line of equation (4.26) are equal to 1 
since F t is self adjoint. This shows that \\dh! (F t )\\s®s < ^i{h)- Combining 
this with equations (4.24) and (4.25) yields 



(4.27) 



\<p[h{F)]-<P[KS)\\ 



J v s {N l F%y s F)*ds -i^i 



i(h). 



Inequality (4.27) is close to the desired result, but as proved it only holds for 
h£C 2 ' ■ Now take any h € C2 , and fix an approximating sequence h n GC 2 ' 
as guaranteed by Theorem 3.20. That theorem shows that J^(^n) — > ^{h), 
while 



\<p[h n (F)]-<p[h n (S)}\ 



J h n d^ F - J K dps 
v[h(F)}-v[h(S)}\ 



hdfis 



as n — > 00, since the supports of //p and fis are contained in K. This shows 
that inequality (4.27) actually holds for all h£C2, and this concludes the 
proof. □ 



Remark 4.5. In equation (4.15), instead of using the Ornstein-Uhlenbeck 
operator, we might have used the Clark-Ocone formula (Proposition 3.28). 
Tracking this through the remainder of the proof would yield the related 
estimate 



(4.28) d C2 {F,S)<-<p®v 



f T(y s F)$(V s F)*ds-l®l 



This estimate is, in many instances, equivalent to equation (4.1) as far as 
convergence to the semicircular law is concerned, as we discuss in Section 4.2; 
the formulation of equation (4.1) is ideally suited to prove Corollary 1.12, 
which is why we have chosen this presentation. 



4.2. Distance estimates. We begin by proving Corollary 1.12, which we 
restate here for convenience with a little more detail. 
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Corollary 1.12. Let f e L 2 (R 2 + ) be mirror- symmetric and normalized 
L 2 (R n ) = 1; an d S be a standard semicircular random variable. Then 



Proof. We will utilize the estimate of Theorem 1.10 applied to the ran- 
dom variable F = I 2 {f) [which is indeed centred and in the domain ^(V)]. 
Note, from the definition, that N^ 1 F = for a double integral. From 
equation (3.34), we have 

(4.29) V t F = V t ll if) = J f(t,t 2 )l®dS t2 + J f(h,t)dS tl ®l. 
Using the fact that f = f* , this yields 

(4.30) (V t F)* = J f(t 2 ,t)l®dS t2 + J /(Ml) ^01. 

(Note: the adjoint on tensor-product operators is, as one would expect, 
(A (g> B)* = A* <2> B* , contrary to the convention on page 379 in [8].) When 
multiplying equations (4.29) and (4.30), one must keep in mind the product 
formula (1.7) for Wigner integrals; in this context of Wigner bi-integrals, 
the results are 



(4.31) 



(4.32) 



(4.33) 



(4.34) 



f(t,s 2 )l®dS s AMJ f(t 2 ,t)l®dS t2 

= f f(t,s 2 )f(t 2 ,t)l®dS t2 dS S2 + f f(t,s)f(s,t)dsl®l, 
f(t,s 2 )l®dS S2 \U ffatJdSt^l 
f(t,s 2 )f(t,ti)dS tl <3dS S2 , 

j /( S1 ,t)ds sl ®iWy f(t 2 ,t)i®ds t . 

f(s 1 ,t)f(t 2 ,t)dS Sl ®dS t2 , 
J f( Sl ,t)dS sl ®l\U f(t,h)dS tl ®l 

f(s 1 ,t)f(t,t 1 )dS Sl dS tl ®l + J f(s,t)f(t,s)dsl®l. 
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Integrating with respect to t and using the identity f(s,t) = f(t,s), we then 
have 

2 J V t {N l F)%(V t Fydt = J J f(t,s 2 )f(t 2 ,t)dtl®dS t2 dS S2 

f(t,s 2 )f(t,t l )dtdS tl ®dS S2 



(4.35) + JJ f( Sl ,t)f(t 2 ,t)dtdS Sl ®dS t2 

+ j j /(si,t)/(t,ti)dtd5 si d5 tl ®l 

+ 2 j \f(s,t)\ 2 dtdsl®l. 

Now using the normalization ||/||£2(r2) = 1, and making use of contraction 
notation (cf. Definition 1.21), we have 

(4.36) = Jj^J(s 2 ,t 2 )l®dS t2 dS S2 + Jj^f{s 2 M)dS tl ®dS S2 

+ J f ^J(sut 2 )dS sl ®dS t2 + J f^f(s 1 ,t 1 )dS Sl dS tl ®l. 

We now employ Theorem 1.10. Equation (4.1) states that 

(4.37) d C2 (F,S) < illMAV^tKVF)^)^) - 1 ® l\\ L i iS9S>v9lp) . 

In any W* -probability space, || • H^i < || • \\ L 2; we will estimate the L 2 (5®<S, 
(p®<p) norm. It is useful to relabel the indices in equation (4.36) and group 
them according to different orders of (bi) chaos; the right-hand side of that 
equation is equal to 

/ 7~f(t2,ti)l®dS tl dS t2 

+ J [f X f{t2M) + f - J{tiM)]dS tl ®dS t2 

+ f f^f(h,t 2 )ds tl ds t2 ®i. 

A simple calculation using the fact that f = f* shows that / ^ f(t 2 ,t\) = 
f f{t\,t 2 ). The three integrals above are in orthogonal orders of chaos; 
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employing the Wigner bisometry 3.29, we have 
4||<V(AV 1 F)tt(VF)%l R+ ) L2(M+) - 1 ® 

(4.38) 

= 117 A 7lli 2( ^) + ll7-/ + /-7lli 2(K2+) + 11/- f\\l mly 

Another simple calculation, again using the identity f(s,t) = f(t,s), shows 
that 

(4.39) =||7A7||| 2(]R+)2 = f dtd Sl ds 2 \f(t, Sl )\ 2 \f(t, S2 )\ 2 , 
while 

(4.40) ||/-7lli 2(R2+) = H7-/lli 2(R+)2 = f R3 dtd Sl d S2 f(t, Sl ) 2 W^) 2 . 

Hence ||/ ^ 7\\l^(U 2 + ) = 117 — fWtffr*.) ^ 11/ — /Hl 2 (R+)- Usin g the triangle 
inequality in equation (4.38) then gives us the estimate 

\\{V(N^FMVFy,l R+ ) L2{R+) - 1 ® if^ S9StV9lp) 

(4.41) 

- 2 II/ — /Hl 2 (M^)' 

and so equation (4.37) and the ensuing discussion imply 

(4.42) d C2 {F, S) = d C2 (l!(f),S) < ||/ A /H^). 
Now, as calculated in equation (2.3) (in this instance with n = 2), 

(4.43) ^2 5 (/) 4 ) = 2 + ||/^/||i 2(M2h) . 
Equations (4.42) and (4.43) together conclude the proof. □ 

Remark 4.6. At first glance it might seem that calculations like those in 
the proof of Corollary 1.12 could be employed to prove similar quantitative 
results for Wigner integrals of arbitrary order n > 2. Note, however, that 
the mirror symmetry of / was used in different ways at several points in the 
above proof. In practice, if one tries to generalize these techniques to J3 , in 
fact / must be fully symmetric. The range of 1^ on fully symmetric functions 
is a very small subspace of the full nth Wigner chaos, and so we do not have 
quantitative bounds for generic higher integrals. 

Remark 4.7. As a quick illustration, we use the first inequality in 
Corollary 1.12 to refine Corollary 2.3 in the case n = 2 and the random 
variables are freely independent 5(0, 1) random variables; in particular, 
p(k) = dko- In this case, one can take these random variables to be such that 
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X k = S k +i - Sk, k > 0, so that V m = if (f m ), with 

^ m—1 

fm{x,y) = —j= ^2 1 (k,k+l](x)l{ k ,k+l]{y)- 



k=0 



Elementary computations now yield ||/ m ^> /m||z,2(jj2 j = 1/y/m, and there- 
fore 



dc*{V m ,S)<-d—. , 
2 V 2to 

which is consistent with usual Berry-Esseen estimates. 

In light of Theorem 1.3, the proof of Corollary 1.12 shows that convergence 
of the quantity on the right-hand side of equation (4.1) to is equivalent 
to convergence of F to S in law, at least in the case of double Wigner inte- 
grals. We conclude this paper with a collection of other equivalences, stated 
in terms of the gradient operator, in the class of double Wigner integrals; 
whether they hold for higher orders, or more generally on the domain f^(V), 
is left as an open question for further investigation. To simplify matters, we 
restrict to the real case for the following. 

Theorem 4.8. Let {fk)k&i be a sequence of fully symmetric functions 
in L 2 (M.+ ), each normalized ||/fc||L 2 (R + ) 2 = 1> an d se ^ Fk = ^2 (/&)• Then for 
each k 

F k = 5(VN 1 F k ) = S(TVF k ). 
Moreover, the following four conditions are equivalent: 

(1) F k converges in law to the standard semicircular distribution S(0,1); 

(2) / V t (7V " 1 F A .)t)(V t F fc )* dt converges to 1 ® 1 in L 2 (S ® S, <p <g> ip); 

(3) / r(V t F fc )t)(V t F fc )* dt converges to I® 1 in L 2 (S ®S,ip® ip); 

(4) f((rV t F k ,TV t F k )}dt converges to 1 in L 2 (S,ip). 

The pairing ((■, •)) : (S ® S) 2 -> 5 is defined by ((X, Y)) = (1 5 ® ^[XtfF]. 
For example, 

/(tiJlOdSi,, / ^2)1®^))= / f(t 1 )g(t 2 )dS t2 dS tl + [ f(t)g(t)dt, 



where we have used the product formula of equation (1.7). On the other 
hand, ((1 <g> dS tl , dS t2 ® 1)) = since <p(dSi) = 0. 

Proof. Equations (4.38) and (4.43) in the proof of Corollary 1.12 show 
that, in the case that / is real-valued, 

2 



J V t {N^F)^V t F) dt-l®l 
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where F = I 2 {f)- In light of Theorem 1.3, this proves the equivalence (1) <^=^ 
(2). The bound 4.28 shows that (3) =>• (1), and so to prove the equivalence 
of (1) and (3) it suffices (due to Theorem 1.6) to prove that the condition 

fk ^ fk ~ * implies (3). To that end, we adopt the standard notation x\/y = 
max{x, y} and x A y = min{x, y}. The following identity is easily proved: 

fk{x,t)f k (y,t)dt) dxdy 



, i xVy / 
(4.44) 

r>oo / rxAy \ 2 



fk(x,t)fk(y,t)dt) dxdy. 
o Vo / 

The following equivalence was proved in [29]. 

fk^fk^O mL 2 (Rl) iff 

fxAy \ 2 

fk(x,t)fk(y,t)dt) dxdy-tO. 



Note that equations (4.44) and (4.45) imply that, if f k ^> f k — > 0, then the 
three functions 

xAy roo 

fk(x,t)f k (y,t)dt, / f k (x,t)f k (y,t)dt, 

JxVy 

(4.46) 

rxAy 



fk(x,t)f k (y,t)dt 

ixVy 

each vanish in the limit. Note also that the action of T on the biprocess Vt-Ffc 
is, as in the classical case, to restrict stochastic integrals to the interval [0,t]. 

(4.47) TV t F k = J f k (t,t 2 )l t2 < t l®dS t2 + J fkih^tt^tdSt,®!. 

The present symmetry assumptions on f k , imply that (VF k )* = VF k . Pro- 
ceeding with calculations like those in the proof of Corollary 1.12, using the 
symmetry and L 2 -normalization of we then have 

j TV t F k $(V t F k y dt - 1 ® 1 = J (^J™ f k (t,t 1 )f k (t,t 2 )dt)l®dS tl dS t2 

\ 

fk(t,ti)fk(t,t 2 )dt dS h ®dS h 



(4.48) 

+ / ( / fk(ti,t)f k (t 2 ,t)dt ) dS tl ®dS t2 



ti 



+ lilt ^(h,t)f(t 2 ,t)dtjdS h dS t2 ^l. 
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Using Fubini's theorem, we can calculate that the L 2 norm of each of the 
four terms in equation (4.48) is given by 

t 2 

fk(t,x)f k (t,y) dt 



/ dxdyi 

JRI \Jx 



OO PX 



dx I dy[ I f k (t,x)f k (t,y)dt 

JO \JxVy 

poo py / poo \ 2 

+ / dy dx[ I fk(t,x)f k (t,y)dt 

JO Jo \JxAy 

Hence, if fk^fk^ 0, then equation (4.46) shows that each of these two 
terms vanishes in the limit. This proves the implication (3) ==> (1). 
For the final equivalence, we use the explicit representation 

((TV t F k ,rV t F k )} 

fk(t, v)H v< _ t dv + j fk(ti,t)f k (t 2 ,t)± tl <ttt 2 <t dS tl dS t2 . 

Integrating with respect to t and using equations (4.44) and (4.45) as above 
proves the equivalence (1) <^=^ (4). □ 

Remark 4.9. As demostrated in [8], Theorem 4.12, the quantity 
J ((WtF k ,WtF k )) dt in condition (4) of Theorem 4.8 can be interpreted 
as the "quadratic variation" of an appropriate free Brownian martingale. 
Note that quadratic variations play a crucial role in the original proof of 
Theorem 1.1, as originally given in [29]. 

Remark 4.10. Once again, one might expect that calculations like those 
above would show the equivalence of items (l)-(4) in Theorem 4.8 for any 
order of chaos (higher than 1), as was proved in the classical case in [29]. In 
principle, this may be possible for fully symmetric kernels /, but in orders 
> 3 of Wigner chaos, such kernels span only a tiny subspace of all stochastic 
integrals. Indeed, it is an interesting open question if a counter-example to 
these equivalences can be found in the third chaos; until now, the authors 
have not been able to find one, but suspect that Theorem 4.8 does not 
generally hold in the free context. 

APPENDIX: PROOF OF THEOREM 3.20 

We break the proof into four steps. First we show that it is sufficient to 
consider only those h £ C 2 that arise as Fourier transforms of compactly- 
supported measures, in Lemma A. 11. Next we reduce to those h that are 
Fourier transforms of measures with a smooth, compactly-supported density, 
in Lemma A. 12. In Lemma A. 13, we show (following [32], Theorem 7.26) 
that there is a polynomial approximate identity on any symmetric compact 
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interval. Finally, we use this approximate identity locally to approximate any 
smoothly-arising h by local polynomials on the Fourier side in Lemma A. 14, 
completing the proof. The proof will actually show that a space smaller 
than C^ ,P is appropriately dense: the local polynomials may be assumed to 
live in the Schwartz space <S(R) of rapidly-decaying smooth functions. 

Lemma A. 11. Let h E €2- There exists a sequence of compactly-supported 
complex measures v n such that, setting h n = v n : 

(1) s 2 (hn) -> Mh); 

(2) if \i is any finite measure, then J h n dfj, — > J hd[i. 

Proof. Let h = v where v is a complex measure satisfying J ^ 2 \v\ (d£) < 
00. Let v n {d£,) = l|£|< n ^(d£), and take h n = u^. Then 

/n 
-n 

Since h £ C2, the function £ 1— > £ 2 is in L 1 (|i^|); hence, by the dominated con- 
vergence theorem, the integrals in equation (A.l) converge to J £ 2 |f|(<i£) = 
^(h) as desired. Now, for any 



(A.2) \h n {x)-h{x)\ 



The integrand l|£|> n converges pointwise to and is bounded, so since \u\ 
is a finite measure, the dominated convergence theorem shows that h n —>h 
pointwise. Finally, note also that ||/i n ||z»°° < J \v n \ < j \v\ < 00, and so since \i 
is a finite measure, one more application of the Dominated Convergence 
Theorem shows that J h n d/x—> J h d\x as desired. □ 

Lemma A. 12. Let /i E C2 with h = u for some compactly-supported com- 
plex measure v. There exists a sequence of smooth C-valued functions ip n € 
such that, setting h n = ip n : 

(1) S 2 (K)^S2{h); 

(2) if [i is any finite measure then J h n dfj,—> J hdfi. 

Proof. Let <j> E be a nonnegative smooth compactly supported 
function, such that J <f)(£)d£ = 1. Let <f> n (0 = n<p(^/n). Define ip n = 4> n * v\ 
then i/j n —}i/ weakly. Note that supp<^ n C supp</>. Since v is compactly sup- 
ported, there is thus a single compact interval K that contains the supports 
of ip n for all n along with the support of u\ moreover, the functions ip n cire 
all smooth since <f) n is smooth. Set h n = ^ n . Hence, 



(a.3) j?2{k)= / / £>m)=Mh\ 

JK JK 

where the convergence follows from the weak convergence of \ip n \ to \v\ and 
the continuity of £ 1— > £ 2 on the compact set K. For the second required 
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convergence, we use Fubini's theorem, 

hn(x)(i(dx)= / i> n (x)n(dx) = / fj,(dx) / e^Vn(£)<#; 



(A.4) 



M0<% I e ix ^(dx) = I mMOdt, 



where the application of Fubini's theorem is justified by the fact that the 
function (a;, £) i— > e tx ^ip n {^) is in x d£) since ipn G ^(d^) and \x is a finite 
measure. The function /I is continuous and bounded since \i is finite, and so 
since ip n — > v weakly and suppf/'n C K for each n, 



(A.5) / mMZ)<%= / mMZ)<%^ / £(£M<*0- 

The function (x,£) i— >■ e* x ^ is in L 1 (// x |f|) since both are finite measures, 
and so we may apply Fubini's theorem again to find that 



(A.6) 



K 



p(t)v(dZ) = I u(dO I e ix Sfi{dx) = / fi(dx) / e ix tv{d£) 



u{x)n{dx) 



where the first equality uses the fact that suppi^ C K. Equations (A.4)-(A.6) 
combine to show that J h n dfx—> j hd/i, as required. □ 

Lemma A. 13. Let r > 0. There is a sequence of real polynomials q n such 
that, for any function f continuous on R and equal to outside of [—r,r], 
the functions 



(A.7) f n {x) = J f(x- t)q n (t) dt=(f* (q n t hr , r] ))(x) 

are polynomials that converge uniformly to f on [—r, r] . 

Proof. This is proved in [32], Theorem 7.26, in the case r = 1 with 
polynomials c n (l — x 2 ) n for appropriate normalization constants c n . Rudin 
only states (and uses) the uniform convergence on [0,1], but it is easy to 
check that the proof yields uniform convergence on [—1,1]. Rescaling the 
polynomials 

(A.8) ,„(*) = ^(r 2 -* 2 )" 

gives us the desired result. To be clear: the functions f n in equation (A.7) 
are polynomials due to the following change of variables: 



fn(x)= I f(x-t)q n (t)dt= I f(x-t)q n (t)dt 

(A.9) 

f(t)q n (x + t)dt, 
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where the second equality is justified by the fact that f(x — t) = unless 
t £ [x — r, x + r] . □ 

Lemma A. 14. Let h£C 2 with h = ift for some ip £ C£°. Let K C R be 
a compact interval. There exists a sequence ift n of functions in the Schwartz 
space S(M.) such that the functions h n = ift n are in C 2 and: 

(1) MK)^Mh); 

(2) if fi is a finite measure supported in K then J h n d\x — > J hdfj,. 

Proof. Choose r > sup{|x| : x £ K}. Let (ft S be nonnegative, with 
support contained in [— r, r], such that (ft{x) = 1 for x £ K (which is possible 
since K is strictly contained in [— r, r]). For convenience, set p n = q n ^-[- r ,r] 
where q n is the Bernstein polynomial of equation (A. 8). Define 

(A.10) ^ n = ift - $■ cft 2 Y + [[(fa) * Pn ] ■ </>] V . 

Note: for a Schwartz function 7 £ 5(R), the function 7 V = 7 denotes the 
inverse Fourier transform of 7, 



Since iftcft £ C£° , the convolution with p n is well defined and C°° ; cutting off 
with eft again yields a function, and so the inverse Fourier transform is 
a Schwartz function. Similarly, (ft 2 is and ift £ 5(R), so their product is 
a Schwartz function, as is its inverse Fourier transform. Thus, ip n £ 5(R). 
Now we compute 

fa = i> - i) ■ (ft 2 + [(fa) *p n ].0 = 4,.(l-(ft 2 ) + [(fa) * p n ] ■ (ft. 

Since (ft(x) 2 = 1 for x € K, we have ift n (x) = [(ift<ft) *Pn](x) for x & K. Since 
the function / = ip(ft is continuous and equal to outside of [— r, r], equa- 
tions (A. 7) and (A. 9) show that ift n is a polynomial on K. Moreover, ift n is 
rapidly decaying and smooth, so J ^ 2 \ift n (0 \ d£ < 00. Thus h n = ift n £ C 2 ' 
as required. We must now verify conditions (1) and (2) of the lemma. 
First, we compute that 

(a.ii) ^ n ( x )-^( x ) = ±- [ e-* x [[(fa)*p r m-kom\m<it 



2vr „ 

Following this we make the straightforward estimate 

\m*)-m*)\<^ [ \[(^)*Pnm-mm\mdc 

(A.12) 

~ 27 



m<ft)*Pn](o-mmm)dc- 
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Lemma A. 13 shows that (ip(ft) * p n converges to uniformly on [— r, r]. 
Hence, since the integrand in equation (A. 12) converges to uniformly on 
the (compact) domain of integration, it follows that ip n (x) — > tp(x) for each x. 

We must now show that J?2(h n ) — > -^(h) (recall that h n = ip n and h = ijj). 
This will follow from the stronger claim that J?2(h n — h)—*Q, which we now 
show to be true. We compute as follows. 

MK-h) = j \MO-m\Z 2 dt = j gniOjf^, 

where g n (C) = £ 2 (1 + £ 2 )l^n(£) — "0(01- We make this transformation so we 
can use the finite measure v(d^) = d^/{\ + £ 2 ) in the following estimates. 
Since ip n — > tp pointwise, it follows that g n — > pointwise. In order to use 
a uniform integrability condition, we wish to bound the L 2 (u)-norm of g n . 
To that end, we compute 

(A.13) \\g n \\h {v) = J g n (0Mdt) = J IMO-miYil + ff-jf^- 

Now, referring to equation (A. 11), ip n — ip = & n where $ n = [(ip(f>) *p n ] ■ 4> — 
ip(j) 2 . Simplifying equation (A.13) yields 

\\9n\\l Hv) = J \M0\ 2 t\i + f)dt< I \M0\ 2 fa + f) 2 dC 

m+z 2 )Mn)\ 2 da. 



Since £ fe tf„(£) = H) fc (^« } ) v (6 for k £ N, this simplifies to 



\\gn\\h {v) < J IK) v (0 + (C) v (0r^- 

That is, \\g n \\ L 2 {v) < ||«J V + (C) V Hl 2 (R) < \\(K) V \\lHr) + \\(K) V \\l* { r) = 
ll^nlli 2 (K) + \Wn IU 2 (M)) where we have used Parseval's identity in the last 
equality. We now must compute some derivatives. Using the fact that (7 * 
pY = y *p whenever 7 and p are functions whose convolution is well defined 
and 7 is C , we have 

(A.14) < = ({U)'*Pn) ■ i> + {{U) *Pn) ■ $ ~ 

< = {{Uf * Pn )-4> + 3((^)" * Pn ) ■ </>' + 3((^>)' * Pn ) ■ </>" 

(A.15) 

+((^)*p„)-0'"- w 2 y. 

The functions ifxfr and i^cf) 2 are both in C£°, and so there is a constant A 
so that ||(V^)( fc )|| L 2 (R) < A and \\{^ 2 ) {k) \\L^R) < A for < k < 3. Since 
G C^°, there is a constant B so that ||z,°°(R) < B for < k < 3. Us- 
ing Young's convolution inequality ||7 * pIIl2(r) < ||7|U 2 (R)IMIli(I0> ec l ua - 
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tion (A. 14) gives us 

||tfnlU 2 (R) < B \\$<f>Y * Pn\\&{M) + B\\ $(/))* p n \\ L 2 {R) + A 

< B\\(i>^)' W^^WpnWLi^) +b||(^)||l2(r)||p„||li(k) + a 

<BA + BA + A, 

where we use the normalization Hz, 1 (M) = 1- A similar calculation using 
equation (A. 15) shows that 

||<|| i2(M) <8iL4 + A 

Hence, we have 

(A.16) sup \\g n \\mv) < sup(|K|| £ 2 (R) + |K'||l2 (r) ) < WBA + 2A<oo. 

n n 

This allows us to conclude the proof as follows. For any M > 0, we have 
J?2(K-h)= g n dv= g n H<g n <Mdv+ \ g n l gn>M dv. 



The first integrand is bounded above by M, and since v is a finite measure, 
the constant M is in L 1 (u). Hence, since we have already shown that g n — > 
pointwise, we conclude that the first integral converges to using the 
dominated convergence theorem. For the second integral, notice that on the 
domain {g n > M} the function g n /M is > 1, and so 

J gnt 9n>M dv< J 9n~ t gn>M dv<^ J gldv sup \\g n \\ 2 L 2 {v) . 

Since this is true for any M, by taking M — > oo while n — > oo we have 
^2{h n — h) — > as desired. 

Finally, since p is supported in K and (f> = 1 on K, 

h n diJ,= ^ n dp= I (I - (f) 2 )^ dp + I ((V^>) *p n ) ■ cf)dn 
Jk Jk Jk 



{ip(j))*p n dp,. 

'K 

By construction *Pn ip4> (uniformly) on K, and also IKV^) *Pn||L°° < 
llPnlli = H^lli 00 < oo. Since p is a finite measure, the dominated 
convergence theorem therefore shows that 

h n dfi= / (ip<f>) * p n dp — >■ / , 4)<j)dp= j ipdfi= hdp. 
Jk Jk Jk J 

This completes the proof. □ 
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